! 



PCT 



WORLD DMIBLLECIUAL PROPeRTY ORGANIZATtOK 
buemuiQiul Bureau 




INTERNATIONAL APPUCATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) IntenuUonal Palcnl ClasilfieitiaB ^ : 
C12Q 1/6S 



Al 



(11) imcrnaUoiuI PubUaitfon Number: WO 97^3000 

(43) International Pttbllaiaon Date; 12 September 1997 (I2i)9.97) 



(21) Intemtioiial AppUcalion Number: PCr/US97/03499 

(22) Inteniatlooal FUlng Date: 4 Moxch 1997 (04.03.97) 



(30) PrIoHty Data: 
6(V012JS2 



4 March 1996 (04J)3,96> 



US 



(71) AppUcaat: OENETRACe SYSTEMS, INC. [USAJSJ; 333 

Ravenswood Avenue. Menio Park. CA 94025 (US). 

(72) iBveotorR MONH>RTE. Joseph, A.; 50 AUikio Ayeniie, 

Beifceley. CA 94708 (US). SHALER. Thomu. A^ 1384 4tli 
Avenue #3. San Piancisco, CA 94122 (US). TAN» Yuping; 
830 Coleman Avenue #16, Menlo Pa*. CA 94025 (US). 
BECKER, Christopher, R; 106 Dover Une. Menlo Park. 
CA 94fm (US). 

(74) Agents: NAXAMURA, Jackie, N. et aL; Cooby Godwaid 
LLP.. Hve Palo Alto Square, 3000 El Camino Real, Palo 
Alio. CA 94306-2155 (US). 



(81) Detj^ted Stater AU. CA, CN, lU JP, KR, European patent 
(AT. BE. CH. DE. DK. ES. Fl FR. GB, OR, IE. IT. LU. 
MC NL. PT. SE). 



Published 

With Inienxational search report. 
Bejore th* tJtpiraiUm of tht time UmU for amending the l 
ciaims and to be republished in the event t^the receipt ofl 
amendments, ' 



(54) Title: METfiODS (>F SCREE^mvrO NUCLEIC ACIDS l^INO XfASS SPECIROMEIKY 
(5T) Abilract 



lUs invention lelates to mettiodt for screening 
nucleic acuU for mutadoni by analyzing nomandamly 
fragmented nucleic actda uting mau t p e c trometric oech- 
niques and to procednies for improving raaai lesolutiec 
and mas& aocmacy of tfaeae roelhoda of detecdog muta- 
dooa. 



SoMmNuoWcAdd 



PGRimpHif 



1t1 



TaigMNugMBAcW 



Munta 



or - MiMMi petlM* or ««oiim MM 



JO. 



|niiiiin|iiiiiiiii| 



ar* m so> at-'W*' a«* 

iuiiiin|iiiiMiu|iiiiiijM|uiniMi| 

» aooa 10000 nooo i«o 



ggST AVAILABLE COPV 



POK THE FVXFOSES OP JNPOMMATim ONLY 



Codes used to idendiy Statet paity to the PCT on dw ftm pa^es of paropbleti pubUahing intMtntfnn-^ 
applicatiow under the PCT. 



AM 




AT 


AimiA 


MJ 




■» 




IB 


Beliiui 


ftr 


BwUmFmo 


BG 


BolfirU 


B| 


BcsIb 


BR 
■V 


Bnill 


CA 


Cuada 


CF 


Ccwil AMctn 


CG 


Co«9» 


C9 


SvrttmUod 


a 

CM 


Gted'lvoiR 


GN 


CUn 


CS 




CZ 




DR 




nc 


Dumit 


SB 




K8 


Spate 


n 


PIiiM 


nt 




OA 


Gtlna 



GB 


Unkpd X»|d0n 


MW 


Uthmi 


GC 




MX 




GN 


Guide* 


KB 


Niter 


OA 


Onum 


NL 


NfldMlMdt 


BU 


Hwmy 


NO 




IB 




MB 


NnrZniMd 


IT 




PL 


Mnd 






PT 




KB 




RO 


Rotunit 


■6 




BU 


RaNfaB Pbdmiioa 


KF 


OMMocrade FeoplB'i RcpiMk 


SB 


Bote 


KB 


BtpubSe cfKomi 


BG 


Stnfapoic 


KZ 




81 


Slovenia 


U 


1 Kttkntffci 


SB 


SknrAia 


LK 


SiJUiki 


» 




LB 


Utab 


St 


^ SwuUnii 


LT 




ID 


Chid 


U) 


Lumbnaf 


T6 


T«»» 


LV 




TJ 


T^PciMaa 


MC 


MQBM9 


TT 


IVmldad a^ Ibt^ 


MD 


BcfMbHe^rMAlJMi 


UA 


Uknioe 


MG 




UG 


(%Mda 


ML 


MiU 


U8 


UilfdSitfnofAflw^ 


MN 




l» 




MB 




VN 


VklNvft 



wo 97/33000 



PCT/US97/03499 



METHODS OF SCREENING NUCLEIC ACIDS USING MASS 
SI^CTROMETRY 



ACKNOWLEDGEMENTS 
This invention was supported in part by a Financial Assistance Award from the 
United Sutes Department of Commerce, Advanced Technology Program. 
Cooperative Agreement #70NANB5H1029. The U.S. Government may have 
rights in this invention. 

Technical Field 

This invention relates generally to mcaiods for screening nucleic acids for 
mutations by analyzing fragmented nucleic acids using mass spectrometry. 

INTRODUCTION 

Approximately 4,000 human disorders are attributed to genetic causes. 
Hundreds of genes responsible for various disorders have been mapped, and 
sequence information is bemg accumulated rapidly. A principal goal of the 
Human Genome Project is to find all genes associated with each disorder. The 
dcfinhive diagnostic test for any specific genetic disease (or predisposition to 
disease) will be the identification of mutations in affected cells that result in 
alterations of gene function. Furthermore, response to specific medications may 
dq)end on the presence of mutations. Developing DNA (or RNA) screening as a 
practical tool for medical diagnostics requires a method that is inexpensive, 
accurate, expeditious, and robust. 

Genetic mutations can numifest themselves in several forms, such as point 
mutations where a single base is changed to one of the three other bases, 
deletions where one or more bases are removed fitim a nucleic acid sequence and 
the bases flanking the deleted sequcsnce are directly linked to each other, and 



wo 97/33000 PCmj897/034» 

2. 

insertions wliei« new bases are ioseited at a particular point in a nucieic acid 
sequence adding additional length to the overall sequence. Urge insertions and 
deletions, often the result of chiomosoiml recombination and reanangement 
events, can lead to partial or coniplete loss of a gene. Of these fonns of mutation, 
in general the most difficult type of mutation to screen for and detect is the point 
mutation because it represents the smallest degiee of molecular change. Hie tenn 
mutation encompasses all the above-listed types of diffeiences from wUd type 
nucleic acid sequence. Wild type is a standart or reference nucleotide sequence to 
which variations are compared. As defined, any variation from wild type is 
considered a mutation including nabiraUy occuniqg sequence polymoxphisms. 

Although a number of genetic defects can be linked to a specific sii^e 
point mutation within a gene. e.g. sickle cell anemia, many are caused by a wide 
spectrum of different mutations throughout the gene. A typical gene that might be 
screened using the methods described hoe could be anywhere ftom 1,000 to 
100.000 bases in length, though smaller «nd larger genes do cxiTi. Of that amount 
of DNA. only a fraction of the base pain acmally encode the protein. These 

discontinDous protein codmg regions are caUed exons and djc remainder of the 
gene is referred to as intions. Of these two types of regions, exons often contain 
the most important se^ienccs to be screened. Several complex, pixjcedures have 
been devefa^ for scanning genes in order to detect mutations, which are 
^^icable to both exons and intiottt. 

QsLBeEBBBbBEBb: Several of the procedures described below use some form of 
gel electrophaiesis. Tlieiirfore it is worMile to briefly consider this separatum 
technology before proceeding to the specific m«bods. In teims of cunent use, 
most of die methods to scan or screen genes employ slab ot eairillaiy gel 
electrophoRsis for the separation and detection step in the assays. Gel 
eleorophoresis of nucleic acids piimarily provides relative size information based 
on mobiliQr tfffough the gel matrix. If calibration staodaids are employed, gel 
electrophoresis can be used to measure absolute and relative mokcnlar weights of 
large biomolecules with some moderate degree of accuracy; even then ^ically the 
accuracy is only 5% to 10%. Also the molecular weight resohition is Umited. In 
cases where two DNA fragments widi identical number of base paiis can be 



wo 97/33000 

PCT/US97/D3499 

3. 

separated, using high concentration polyaciylamide gels, it is still not possible to 
identify which band on a gel corresponds to which DNA fragment without 
performing secondary Jabefing experiments. Gel electrophoresis techniques can 
only determine size and cannot provide any information about changes in base 
composition or sequence without performing more complex sequencing reactions. 
Gel-based techniques, for the most part, arc dependent on labeling methods to 
visualize and discriminate between different nucleic acid fragments. 

PNA Scqueriging : The principal approach cunenUy used to screen for genetic 
mutations is DNA sequencing. Sequencing reactions can be performed to screen 
the full genetic target base by base. This process, which can pinpoint the exact 
location and naoiie of mutation, requires labeling DNA. use of polyacrylamide 
gels, and a multiplicity of reacUons to assess all bases over the length of a gene, 
all of which are slow and labor intensive procedures, f J. Bergh et al. "Complete 
Sequencing of the p53 Gene Provides Prognostic InfonnaUon in Breast Cancer 
Patients. Particularly in Relation to Adjuvant Systemic Therapy and 
RadioAerapy. " Nature Medicine L 1029 (1995)] 

For DNA sequencing, nucleic acids comprising different exons or small clusters of 
exons are individually amplified, often using polymerase chain reaction (PGR). 
The amplificatioiis are nonnaliy peifonned separately although some multiplexing 
ofreactions is possible. The amplified nucleic acids typically range from one 
hundred to several thousand bases in length. Following amplification, the PGR 
products can serve as temphites for standard dideoxy-based Sanger sequencing 
reactions. TTic four different sequencing reactions are run (or for fluorescence 
detection, one reaction witii four difTerett dye tenninatois) and then analyzed by 
polyivrylamide gel electrophoresis. Each sequencing run yields about 300 to 600 
bases of sequence which typically must be read wiUi at least a two to dtree-fold 

redundancy m order to assure accuracy. Usii« slab gel. the analysis process 
typically takes several hours. 



wo 97/33000 PCT/US97/W4^ 

4. 

SSCP: The single strand conformational polymorphism assay takes advantage of 
structural variation within DNA that results from mutation. The method involves 
folding the single-stranded form of a given nucleic acid sequence into a 
thermodynamically directed secondary and teitiary structure. In most cases, 
mutated sequences form different strucnires than the wild type sequence, thus 
permitting separation of mutated and wild type sequences by gel electrophoresis. 
Like sequencing, this assay is complicated by the need to label molecules and run 
poiyaciylamide gels. In a typical case, mutations can be located within a general 
range of 50 to 200 base pairs, but the exact namre of the mutation cannot be 
identified. [M. Oriu et al., "Detection of Polymorphisms of Human DNA by Gel 
Electrophoresis as Single-Stranded Confonnation Polymorphisms/ Proc. Natl. 
Acad. Sci. USA fig. 2766 (1989)1 

DGGE : Like SSCP, denaturing gradieni gel electrophoresis assays also 
differentiate based on stnicmral variation, but require the use of gradient gels, 
which are difficult to prepare. The different thermodynamic stability of structures 
formed by the mutant sequence, as opposed to wild type, lead to differences in the 
temperature and/or pH at which the molecule will denature. DGGE mutation 
identification and localization properties are similar to those for SSCP though 
sensitivity is higher for DGGE because not all mutations cause the strucmral 
changes tiiat the SSCP method depends upon for detection. TE.S. Abrams, S.£. 
Murdaugh & L.S. Lerman, "Con^nehensive Detection of Single Base Changes in 
Human Genomic DNA Using Denaturing Gradient Gel Electrophoresis and a GC 
Clamp/ Genomics 2, 463 (1990)] 

EMC : Enzyme mismatch cleavage utilizes one or more enzymes that are capable 
of recognizmg intemiptions in base pairing widiin a double-stranded nucleic acid 
molecule, e.g. base-base mismatches, bulges, or internal loops. A given length of 
DNA or RNA is prepared in heterozygous form, with one strand composed of 
wild type nucleic acid and the other strand containing a potential mutation. At the 
specific site where the mutation forms a mismatch with the wild type sequence, a 
structural permrbation occurs. An enzyme such as T4 endonuclease VII, RuvC, 



PCT/US97/03499 

5. 

RNase A. or MuiY. can recognize such a structural perturbation and can site- 
specificaily cut the double-stranded nucleic acid, creating smaUer molecules whose 
sizes indicate the presence and location of the mutation. As with the previously 
discussed methods, this approach as currently used, also requires labeling and gel 
electrophoresis. With this method, the site of mutation can be localized to within 

a few base pairs but the exact nature of the mutation cannot be determined. [R. 
Youil. B.W. Kemper & R.G.H. Cotton, "Screening for Mutations by Enzyme 
Mismatch Cleavage with T4 Endormdease VD." Proc. NaU. Acad. Sci. USA 92. 
87 (1995)] 

CCM: A variation of EMC is to replace the en^matic cleavage step with chemical 
cleavage. Chemical cleavage mismatch analysis involves the use of reagents such 
as oanium tetroxide to react with mismatched thymine residues or bydroxylamine 
to rract with mismached cytosine residues. Cleavage of the modified mismatched 
residues occurs when the modified bases are subsequently treated with piperidine 
or anodier oxidizing agent. The effectiveness of the mediod is similar to EMC. 
[J. A. Saleeba & R,G.H. Cotton. "Chemical Cleavage of Mismateb to Detect 
Mutations." Methods in Enzymology 217, 286 (1993)J 

Hybridtotjon Anray,^- several approaches to screening for mutations involve the 
probing of a target nucleic acid by an array of oligonucleotides that can 
differentiate between normal wild type nucleic acids and mutam nucleic acids. 
These arrays involve the performance of hundreds or thousands of hybridization 
reactions in parallel with different site-directed oligonucleotides and requires 
sophisticated and cosUy probe arrays. Hybridization arrays can identify the 
location and type of mutation in many, but not all cases. For example, 
semihomologous sequential insertions or targets with repeating sequences and/or 
rgwating sequential motifs cannot be analyzed by hybridization. [A.C. Pease et 
al., "Light-Generated Oligonucleotide Arrays for Rapid DNA Sequence Analysis." 
Proc. Natl. Acad. Sci. USA £1. 5022 (1994)] 



WOy7/33000. PCT/US97/03499 

6. 

Simple screens : For mutations localized within a given gene, such as the cystic 
fibrosis dF508 deletion, it is also possible to perfom a single PGR or ligase chain 
reaction (LCR) assay or simple hybridization assays tailored to these specific sites. 
PGR and LCR results are presently determined by Che use of labeled molecules, 
where radioactive emissions, fluorescence, chemiluminescence or color changes 
are detected directly* These simple screens amount to a yes/no answer and do not 
directly identify the nature of the mutation, only whether or not a reaction took 
place. [P. Fang ct al.. ''Simultaneous Analysis of Mutant and Normal Alleles for 
Multiple Cystic Fibrosis Mutations by the Ligase Chain Reaction/ Human 
Mutation 144 (1995)] 

All of the methods in use today capable of scteening broadly for genetic 
mutations suffer from technical complication and are labor and time intensive. 
There is a need for new methods that can provide cost effective and expeditious 
means for screening graetic material in an effort to reduce medical expenses. The 
inventions described here address these issues by developing novel, tailor-made 
processes that focus on the use of mass spectrometry as a genetic analysis tool. 
Mass spectrometry requires minute samples, provides extremely detaUed 
ixifonnation about the molecules being analyzed including high mass accuracy, and 
is easily automated. 

The late 1980's saw the rise of two new mass spectrometric techniques for 
successfully noeasuring the masses of intact very large biomolecules, nanoely, 
matrix-assisted laser desocption/ionization (MALDI) time-of-flight mass 
spectrometiy (TOiF MS) [K. Tanaka et al., '^Protein and Polymer Analyses up to 
m/z 100,000 by Laser Ionization Time-of*fligbt Mass Spectrometry,'* Rapid 
Commun, Mass Spectrom. 2. 151-153 (1988); B. Spengler et al,, "Laser Mass 
Analysis in Biology/ Ber. Bunsepgcs. Phys. Chem. 22, 396-402 (1989)] and 
electrospray ionization (EST) combined with a variety of mass analyzers [J. B. 
Fcnn et al.. Science 246, 64-71 (1989)]. Both of these two methods arc suitable 
for genetic screening tests. The MALDI mass spectrometric technique can also be 
used with methods other than time-of-flight, for example, magnetic sector, 
Fourier-Transform, ion cyclotron resonance, quadropole, and quadropole trap. 
One of the advances in MALDI analysis of polynucleotides was the discovery of 



wo 97/33000 PCT/US97y03499 

7. 

3-hydroxypicolinic acid as an ideal matrix for mixed-base oligonucleotides. Wu, 
et al.. Rapid Commons in Mass Spectrometfy, 7:142-146 (1993). 

MALDI-TOF MS involves laser pulses focused on a small sample plate 
comprising analyte molecules (nucleic acids) embedded in either a solid or liquid 
matrix comprising a small, highly absorbing compound. The laser pulses transfer 
energy to the matrix causing a micFoscopic ablation and concomitant ionization of 
the analyte molecules, producipg a gaseous plume of intact, charged nucleic acids 
in single-stranded form. If double-stranded nucleic acids are analyzed, the 
MALDI-TOF MS typicalbr results in mostly denatured single-strand detection. 
The ions generated by the laser pulses are accelerated to a fixed kinedc energy by 
a strong electric field and then pass through an electric field-free region in vacuum 
in which the ions travel with a velocity corresponding to their respective mass-to- 
charge ratios (ra/z). The smaller m/z ions will travel through the vacuum region 
faster than the larger m/z ions thereby causing a separation. At the' end of the 
electric field-free region, the ions collide with a detector that generates a signal as 
each set of ions of a particular mass-io-charge ratio strikes the detector. Usually 
for a given assay, 10 to 100 mass spectra resulting ftum individual laser pulses are 
summed together to make a single composite mass spectrum with an improved 
signal-to-noise ratio. 

The mass of an ion (such as a charged nucleic acid) is measured by using 
its velocity to determine the mass-to^harge ratio by time-of-flight analysis. In 
other words, the mass of the molecule direcdy correlates with the time it takes to 
travel from the saznple plate to the detector. The entire process takes only 
microseconds. lb an automated apparatus, tens to hundreds of samples can be 
analyzed per minute. In addition to speed, MALDI-TOF MS has one of the 
largest mass ranges for mass spcctromctric devices. The current mass range for 
MALJI-TOF MS is from 1 to 1,000,000 Daltons (Da) (measured recently for a 
protein). [R. W. Nelson et al., ^'Detection of Human IgM at m/z - 1 MDa/ 
Rapid Commun. Mass Spectrom. 9» 625 (1995)] 

The performance of a mass spectrometer is measured by its sensitivity* 
mass resolution and mass accuracy. Sensitivity is measured by the amount of 
material needed; it is generally desirable and possible with mass spectrometry to 



wo 97/33000 PCT/US97y03499 

8. 

work with sample amounts in tiie femtomole and law ptcomole range. Mass 
resolution, m/Am, is the measure of an instrument's ability to produce separate 
signals from ions of similar mass* Mass resolution is defined as the mass, m, of a 
ion signal divided by the full width of the signal, Am. usually measured between 
points of half-maximum intensity. Mass accuracy is the measure of error in 
designating a mass to an ion signal. The mass accuracy is defined as the ratio of 
the mass assignment error divided by the mass of the ion and can be represented 
as a percentage. 

To be able to detect any point muution directly by MALDNTOF mass 
spectrometry, one would need to resolve and accurately measure the masses of 
nucleic acids in which a single base change has occurred (in comparison to the 
wild type nucleic acid). A single base change can be a mass difference of as little 
as 9 Da. This value represents the difference between the two bases with the 
closest mass values, A and T (A = 2*-deoxyadcnosine-5 '-phosphate = 313.19 Da; 
T = 2*-dcoxythymidinc-5'-phosphatc = 304,20 Da; G « 2*^coxyguanosinc-5*- 
phosphate = 329.21 Da; and C = 2*-deoxycytidine-5'-phosphate = 289.19 Da). 
If during the mutation process, a single A changes to T or a single T to A, the 
mutant nucleic acid containing the base transversion will either decrease or 
increase by 9 m total mass as compared to the wild type nucleic acid. For mass 
spectrometry to directly detect these transversions, it must therefore be able to 
detect a minimum mass change, Am, of proximately 9 Da, 

For example, in order to fully resolve (which may not be necessary) a 
point-mutated (A to T or T to A) heterozygote SO-base single-stranded DNA 
fragment having a mass, m, of ^ 1S,000 Da from its corresponding wild type 
nucleic acid, the required mass resolution is m/Am lS,(XX)/9 » 1,700. 
However^ the mass accuracy needs to be significantly better than 9 Da to increase 
quality assurance and to prevent ambiguities where the measured mass value is 
near the half-way point between the two theoretical masses. For an analyte of 
13,000 Da, in practice the mass accuracy needs to be Am - ±3 Da - 6 Da. In 
this case, the abstdUIS mass accuracy required is (6/15,000)'^100 =^ 0.04%. Often 
a distinguishing level of mass accuracy xslati^rg to another known peak ip the 
spectrum is sufTicient to resolve ambiguities. For example, if thet^e is a known 



'^091f^m PCT/US97/03499 

9. 

mass peak 1000 Da from the mass peak in question, the relative position of the 
unknown to the known peak may be known with greater accuracy than that 
provided by an absolute, previous calibration of the mass spectrometer. 

In order for mass spectrometry to be a useful tool for screening for 
mutations in nucleic acids, several basic requirements need to be met. First, any 
nucleic acids to be analyzed must be purified to the extent that minimizes salt ions 
and other molecular contaminants diat reduce the intensity and quality of the mass 
spcctrometric signal to a point where cither the signal is undetectable or 
unreliable, or the mass accuracy and/or resolution is below the vahie necessary to 
detect single base change mutations. Second, the size of the nucleic acids to be 
analyzed must be within the range of the mass specirometiy-where there is the 
necessary mass resolution and accuracy. Mass accuracy and resolution do 
significantly degrade as the mass of the analyte increases; currently this is 
especially significant above approximaieiy 30,000 Da for oUgonucleoiidcs (-100 
bases) Third, because ail molecules within a sample are visualized during mass 
spectroroctric analysis (ix, it is not possible to selectively label and visualize 
certain molecules and not others as one can with gel electrophoresis methods) it is 
necessary to partition nucleic acid samples prior to analysis in oixler to remove 
unwanted nucleic acid products ftx>m the spectrum. Fourth, the mass 
speciromctric methods for generalized nucleic acid screening must be eflicient and 
cost effective in order to screen a large number of nucleic acid bases in as few 
steps as possible. 

The methods for detecting nucleic acid mutations known in the an do not 
satisfy these four requirements. For example, prior art methods for mass 
spcctromen-ic analysis of DNA fragments have focusscd on double-stranded DNA 
fragments which result in complicated mass spectra, making it difficult to resolve 
mass differences between two con5)lemcntary strawls. See, e.g., Tang et al., 
Rapid Common, in Mass Spectrometry, 8:183-186 (1994). 

Thus, there is a need for cost and time effective methods of detecting 
genetic mutations using mass spectrometry, preferably MALDI or ES, without 
havii« to sequence the genetic material and with mass accuracy of a few pans in 
10,000 or better. 



wo 97/31000 PCr/US97/03499 

10. 

SUMMARY OF THE INVENTION 

The present invention provides methods of and kits for detecting mutations 
in a target nucleic acid comprising nonrandomly fragmenting said target nucleic 
acid to form a set of nonrandom length fragments (NLFs), determining masses of 
members of said set of NLPs using mass spectrometry, wherein said determining 
does not involve sequencing of said target nucleic acid. 

In a preferred embodiment, the method of detecting mutations comprises 
obtaining a set of nonrandom length fragments in single-stranded form. The 
masses of the members of the sec of NLFs can be compared with the known or 
predicted masses of a set of NLFs derived from a wikl type target nucleic acid that 
is the wiki type version of the target nucleic acid that is being screened for 
mutations. The members of the set of single-stranded NLFs can optionally have 
one or more nucleotides replaced widi mass-modified nucleotides, including mass- 
modified nucleotide analogs. Another cpticnal aspect of the inventiou is the 
inclusion of internal calibrants or internal self-calibrancs in the set of nonrandom 
length fragments to be analyzed by mass spectrometry to provkie improved mass 
accuracy. 

The present invention includes a number of nonrandom fragmentation 
techniques for nonraiKiomly fragmenting a target nucleic acid. 

In one embodiment, the nonrandom fragmentation technique comprises 
hybridizing a single-stranded target nucleic acid to one or more sets of 
fragmenting probes to form hybrid target nucleic acid/fragtnenting probe 
complexes comprising at least one double-stranded region and at least one single- 
stranded region, nonrandomly fragmenting said target nudek: acM by cleaving said 
hybrid target nucleic acxl/£ragmenting probe complexes at every single-stranded 
region with at least one single-strand-qwcific cleaving reagent to form a set of 
NLFs. The set of fragmenting probes can leave single-stranded regions between 
double-stratKled regions formed by hybridizalton of said set of fragmenting probes 
to said target nucleic acid. A single-stranded region comprises a portion of a 
polynucleotide sequence as small as a single phosphodiester bridge, i.e. the 
phosphodiester bond across from a nick, to 450 nucleotides in length. 



wo 97/33000 PCT/US97/03499 

11. 

The firagmenting probes are oligoniicleotides that are complementary to a 
nucleotide sequence of the target oucleic acid. A set of fragmenting probes can be 
created such that the nucleotide sequences of the members of the sec of 
fragmenting probes represents the entire complement to the nucleotide sequence of 
the taiget nucleic acid. For example, a set of fragmenting probes can provide 
complete complementary sequence to the target nucleic acid. Alternatively, a set 
of fragmenting probes, when hybridized to the target nucleic acid, can leave 
single-stranded regions. Also, one or more sets of fragmenting probes can be 
used such diat the members of one set of fragmenting probes contain nucleotide 
sequences chat overlap with nucleotide sequences of members of a second set of 
fragmenting probes. In yet another aspect, there are provided two sets of 
fragmenting probes, where members of the second set of fragmenting probes 
comprise at least one single-stranded nucleotide sequence coniq)lementary to 
regions of said target nucleic acid that are not complememary to any nucleotide 
sequences in any members of said first set of firagmenting probes. 

Once the set(s) of fragmenting probes are hybridized to the target nucleic 
acid, the stngEe-stranded regions are cleaved using single-strand-specific cleaving 
reagents, including enzymatic rei^ents as well as chemical reagents. Single-strand 
specific chemical cleaving reagents include bydroxylamine. hydrogen peroxide, 
osmium tetroxidc. and potassium pemianganate. 

Yet another nonrandom fragmentation technique comprises providing a 
single-straixled target nucleic acid, hybridizing the single-sd-anded target micleic 
acid to (me or more restriction site probes to form hybridized target nucleic acids 
comprising double-stranded regions where said restriction site probes have 
hybridized to said single-stranded taiget nucleic acid and at least one single- 
stranded region, nonrandomly fragmenting the hybridized target nucleic acids 
usinig one or more restriction endonucleases that cleave at restriction sites within 
the doubIe*stranded regions. Another variation on this technique involves use of 
universal restriction probes comprising two regions, the first region being single- 
stranded and complementary to a specific site within the target nucleic acid, and 
the second region being double-stranded and containing the restriction recognition 
site for a particular class US restriction endonuclease. Chss IIS restriction 



wo 97/33000 PCiyUS»7/03499 

12. 

eiidonucleases cleave double-stranded DNA at a specific distance from their 
recognition site sequence. 

Another technique for nonraodom fragmentation comprises fragmenting the 
target nucleic acid witii one or more restriction endonucJeases to form a set of 
NLFs. This and the other fonns of nonrandoro fragmentation can be combined 
with direct and indirect capture to a solid support to isolate single-stranded NUs 
for mass spectrometric analysis. 

Another nonrandom fragmentation technique comprises providing 
conditions permitting folding of said single-stranded target nucleic acid to form a 
three-dimensional struchire having intramolecular secondary and tertiary 
interactions, and nonrandomly fiagmeming said folded target nucleic acid with at 
least one structure-specific endonuclease to form a set of single-stranded NLFs. A 
set of nonrandom length fragments can comprise a nested set of NLFs« wherein 
each member of the set has a 5* end of the target nucleic acid. The strucmre- 
specific endonucleases usefitl for nonrandom fragmentation comprise any nucleases 
that cleave at structural transitions within nucleic acids, including: HoUiday 
junctions, sitigle-strand to double-strand transitions, or at the ends of hairpin 
structures. 

Another nonrandom fragmentation method comprises mutation-specific 
cleavage by hybridizing a target nucleic acid to a set of one or more wild type 
probes and specifically cleaving at any regions of nucleotide mismatch or base 
mismatch that form between the target nucleic acid and a wild type probe. The 
mutation-speciftc cleava^ can be accomplished using a mutation-specific cleaving 
reagent comprising structure-specific endonuclease or chemical reagents. 

The nonrandom fragmentation methods described herein can be combined 
to form different sets or subsets of nonrandom length fragments. For example, 
the base mismatch nonrandom fragmentation method using wild type probes can be 
used in concert with a set of nonrandom length fragments that have already been 
creating using any one of the other nonrandom fragmentation methods. These 
nonrandom fragmentation methods can also be combined with isolation methods 
designed to isolate specific sets of singte-stranded nonrandom length fragments, 
for example, only those NLPs derived from the + strand of the target nucleic 



wo 97/33000 „ 

PCT/US97/03499 

13. 

acid. The isolation mcchods inchjde direct capture of the set of NLFs to a solid 
support or indirect capture of > set of NLFs to a solid support via a captuie probe 
capable of binding to a solid support via covalent or noncovalent binding. The 
fragmenting, wild type, restriction site, and universal lestriction ptobes described 
herein can be also be used as capture probes for isolating a panicular set of NLFs. 

The isolation methods also comprise the use of a soluUon of volatile salts to 
wash away undesired contaminants from the set of NLFs intended for mass 
determination in the mass spectrometer. The volatile salts ate useful for nmoving 
background noise and can be easily removed by evaporation of the volatile salts 
prior to mass spectiometric analysis. Volatile salt soluUons can be used in a 
variety of different methods to prepare organic molecules sucb as nucleic acids and 
polypeptides for mass spcctrometiic analysis. Thus, a method is described herein 
of decreasing baclcground noise, wherein the method comprises obtainiQg a sample 
to be analyzed by a mass spectrometer, washing tbe sample with a solutioD of 
volatile salts, and evaporating the solution of volatile salts from the samiHe. 

The fragmentation and isolation methods separately or together can also be 
combined with the use of inteiral self-calibnuBs to impiove the mass accuracy of 
the mass spectrometric analysis. 

The above methods, separately or in combination, can also be combined 
with the use of mass-modified nucleotides and mass-modified nucleotide analogs 
incorporated in the taiget nucleic acid or a set of NLFs to improve mass lesohiUon 
between mass peaks. 

iCits for detecting mutations in one or more target nucleic acids in a sample 
are also provided. In prefcired embodiments, such kits comprise one or more 
singlcslranded target nucleic acids, one or more sets of oUgonucleotide probes, 
wherein each of said probes is complementary to a portion of said single-stranded 
laiget nucleic acids, and various cleaving reagents, inchiding single-strand specific 
cleaving reagents, restriction endonucleases (both Class 11 and Class US), and 
mutation-specific cleaving reagents. The oligonucleotide probes include 
fragmentingpiobes, restriction site probes, and wUd type probes. Such kits can 
also contain a matrix, preferably 3-hydroxypicolinic acid. The kits may also 
contain volatile salt buffers, and buffers providing conditions suitable for the 



wo 97/33000 I'CT/US97/03499 

14. 

enzymatic or chemical reactions described above for Donrandomly fragmenting 
target nucleic acids and isolating nonrandom length fragments in preparation for 
mass spectromeiric analysis. Additionally* the kits may conuin solid supports for 
purposes of isolating nonrandom length fragments. 

BRIEF DESCRIPTION OF THE DRAWINGS 
FIG. 1 A and IB display examples of resolved nucleic acid fragments 
(DNA) in the 20,000 to 30,000 Da range using MALDI-TOF mass spectrometry. 
Both FIG. lA and IB are positive ion mass spectra obtained from 200 fmoles of 
DNA in 3-HPA (3-bydroxypicolinic acid). Each spectrum is a sum of lOO laser 
pulses at 266 nm. FIG. lA: a single-stranded 72-mer which also shows a 71* 
mer. The FWHM resolution is 240, clearly resolving matrix adducts (labelled M). 
FIG. IB: 88*mer parent peak has a resohitton of 330. 

FiG* 2 is a diagram illustrating the basic steps for mass spectromeiric 
analysis of a nonrandomly-ftagmented, double-stranded target nucleic acid. 

FIG. 3 is a diagram illustrating the expected mass spectrum for a 
nonrandomly-fragmented double-stranded target nucleic acid diat is a heterozygous 
mix of wild type and mutant nucleic acid where the mutation is an A to T 
transversion. 

FIG» 4A and 4B illustrate the effect on mass resolution of a mass- 
substituted base where a T has been replaced by hepcynyldeoxyuridine during 
amplification of the mutant region. FIG. 4A depicts a mass spectra of a 
heterozygous mix of wild type and mutant where A has mutated to T. Spectral 
peaks are separated by 9 mass units. FIG. 4B depicts a mass spectra of a 
heterozygous mix of wild type and nmtant where A has mutated to T. T has been 
replaced by hepiynyldeoxyuridine during amplification of the mutant region. 
Spectra] peaks are now separated by 65 mass units. 

FIG* 5 is a diagram illustrating the affect of analyzing only poshive strand 
fragments from a heterozygous sample in reducing the number of total fragments 
and simplifying the mass q)ectrum. 

FIG. 6 is a diagram illustrating the use of restriction site probes to produce 
nonrandom fragments from single-stranded target nucleic acid. Note that in the 



wo 97/33000 PCT/US^m499 

15. 

step of purifying nonrandom length fragments, the smaJI cleaved probes will likely 
be removed during purification. 

FIG. 7A and B illustrate the use of fragmenting probes in conjunction with 
single-strand-specific cndonuclease to produce nonrandom fragments from single- 
stranded target nucleic acid. 

FIG* 8 is a diagram illustrating the use of fragmenting probes in 
conjunction with single-strand-specific, base-specific chemical cleavage to produce 
nonrandom fragments from single-stranded target nucleic acid. 

FIG, 9A and B ilhistrate the use of fragmenting probes to produce 
nonrandom fragments from heterozygous, single-stranded target nucleic acid, in 
combination with the use a mismatch-specific cleaving reagent to further fragment 
the target nucleic acid at the site of a mutation. 

FIG. 10 is a diagram illustrating a method of detecting a mutation using 
mass spectrometric analysis of aonraiidomty fragmemed mutant and wild-type 
double-stranded nucleic acids that have been denaured and reannealed and then 
cleaved at any mismatch regions. 

nG. 11 is a diagram illustrating the effect of analyzing only positive 
strand fragments from a heterozygous sample in reducing the number of total 
fragments and simplifying the mass spectrum. In this case the positive strand has 
been nonrandomly fragmented using both restricdon cndonuclease treatment and 
mismatch-specific cleavage. 

FIG. 12 is a diagram illustrating the use of structures-specific 
endonucleases to nonrandomly fragment a folded, single-stranded target nucleic 
acid. ^ 

FIG. 13A and B illustrate die use of a full length capmre probe to isolate 
and purify a set of single-stranded nonrandom length fragments. Shown in FIG. 
13B as an option is a second step involving cleavage at mutation-specific 
mismatch. This mismatch cleavage is paiticularly useful for cases where mutant 
ONA is hybridized to wild type. 

FIG. 14 is a mass spectrum of a set of nonrandom length fragments from a 
target nucleic acid containing a mutation, wherein the target nucleic acid is 
nonrandomly fragmented widi hydroxylamine followed by piperidine, resulting in 



wo 97/33000 

16. 

mutation-specific cleavage at a mismatch. This mass spectrum illustrates the 
presence of a nonrandom Length fragment of 7S bases in size, that results from 
mutation-specific cleavage. 

FIG. 15 is a mass spectrum illustratiiig hydroxylamine fragmentation of a 
wild type control of the mutation-containmg target nucleic acid of Fig. 14. This 
mass spectrum lacks a fragment of 73 bases in size due to the lack of a mutation 
in the wild type target nucleic acid. 

FIG. 16 is a mass spectrum of a mutation-containing target nucleic acid 
that is specifically cleaved with potassium permanganate at the site of a base 
mistnatch. 

FIG. 17 is a mass speccrum of a set of 5 single-stranded nonrandom length 
fragments from an MNL I digest of a wild type urgec nucleic acid of 184 
nucleotides in length. 

EiG. 18 is a magnified mass spectrum of two fragments, both 26 bases in 
length, identical in nucleotide sequence except for a single G to A point mutation, 
illustrating clear resolution of the two mass peaks. 

DESCRIPTION OF SPECIFIC EMBODIMENTS 
The present invention, directed to methods of screening target nucleic acids 
to detect mutations usmg mass spectrometric techniques to analyze post- 
amplification nucleic acids, provides the advantages of technical ease, speed, and . 
high sensitivity (minute samples are required). The methods described herein 
yield a minimal set of products with improved mass resolution and accuracy and 
detailed information about the namie and location of the 
mutation in the target nucleic acid. 

The present invention involves obtaining from a target nucleic acid, using a 
variety of nonrandom fragmentation techniques, a set of nonrandom length 
fragments (NLFs) and determining the mass of the members of the set of NLFs. 

The target nucleic acid can be single-stranded or double-stranded DNA» 
RNA or hybrids thereof, from any source, preferably from a human source, 
although any source which one is interested in screening for mutations can be used 
in the methods described herein. When the target nucleic acid is RNA, the RNA 



wo 97/33000 PCT/US97/03499 

17. 

Strand is the + strand. If desired, the target nucleic acid can be an RNA/DNA 
hybrid, wherein either strand can be designated the 4- strand and the other, the - 
strand. The target nucleic acid is generally a nucleic acid which must be screened 
to determine whether it contains a mutation. The correspondmg target nucleic acid 
derived from a wild type source is referred to as a wild type target nucleic acid. 
The target nucleic acids can be obtained from a source sample containing nucleic 
acids and can be produced from the nucleic acid by PCR amplification or other 
amplification technique. The target nucleic acids are typically too large to analyze 
directly because current mass spectrometric methods do not have die mass 
accuracy and resolution neccssaiy to identify a single l>ase change m molecules 
larger than 100 base pairs. Accordingly, the target nucleic acids must be 
fragmented. 

Nonrandom length fragments are nucleic acids derived by nonrandom 
fragmentation of a target nucleic add, and can comprise regions or nucleotide 
sequences that are single-stranded or double-stranded. Due to the simpler mass 
spectrum that results from mass analysis of single-stranded nonrandom length 
fragments, it is preferred to determine the masses of sets of single-stranded 
mmrandom length fragments. The nonrandom length fragments can also contain 
mass-modified nucleotides, which can enhance ease of analysis, especially when a 
point mutation has resulted in a very small mass change (on the order of 9 Da) in 
a nonrandom length fragment as compared to the coirespondii^ wild type 
nonrandom length fragment. The medKxls described herein use mass spectrometry 
to determine the masses of the set or sets of nonrandom iei^th fragments to detect 
mutations in a target nucleic acid. 

The nonrandom fragmentation techniques of die invention arc any methods 
of fragmenting nucleic acids that provide a defined set of nonrandom length 
fragments, where that set of nonrandom length fragments may be reproducibly 
obtained by usiDg the same nonnmdom fragmentation method on the same target 
nucleic acid or its wild type version. The methods used for nonrandom 
fragmentation are designed to optimize the ease of analyzing the resulting mass 
spectral data by obtaining a range of fragment sizes that avoids significant overlap 
of mass peaks. The nonrandom fragmentation techniques of the invention include 



wo 97/33000 PCT/irS97y03499 

18. 

digestion with resuiction eodonucleases* structure-specific endonucleases. and 
specific chemical cteavage. The enzymatic nonrandom fragmentation techniques 
include panial digestion with restriction endonucleases or suucture-spectflc 
endonucleases. Partial cleavage occurs when not every possible cleavage site is 
cleaved by the cleaving reagents used, whether enzymatic or chemical. 

Fragmenting probes used in the invention are nucleic acids comprising a 
single-stranded nucleotide sequence or region that is complementary to a 
nucleotide sequence of a target nucleic acid. When fragmenting probes are also 
used as capture probes (i.e. to bind the fragmenting probe and any complementary 
nucleic acids hybridized thereto to a solid support), the fragmenting probes 
comprise a first binding moiety that is capable of binding to a second binding 
moiety attached to a solid support. Upon hybridization of a set of fragmenting 
probes and a target nucleic acid, the hybrid can be nonrandomly fragmented using 
one or more cleaving reagents that specifically cleave siogle-scrandcd regions. 

Restriction site probes are oligonucleotides that when hybridized to single- 
stranded target nucleic acid at specific complementary sequences form complete 
double-stranded restriction endonuclease recognition sites cieavable using the 
restriction endonuclease capable of cleaving at or near the recognition sites 
formed. 

Universal restriction probes comprise two regions* the fint region being 
single-stranded and complementary to a specific sequence within the target nucleic 
acid, and the second region being double-stranded and containing the restriction 
recognition site for a particular class IIS restriction endonuclease. 

Capture probes used in the methods described herein conqnise fragmenting 
probes, restriction site probes, universal restriction probes, and any nucleic acids 
that are bound to a solid support to isolate sets or subsets of nuckic acids or 
NLFs. Capture probes can comprise a cieavable linkage or cieavable moiety that 
can be selectively cleaved to release nucleic acids from a solid support prior to 
mass spectromeoic analysis. 

Wild type probes are nucleic acids derived from a wild type nucleic acid 
sequence comprising at least one nucleotide sequence complementary to a 
nucleotide sequence of a target nucleic acid or a member of a set of NLPs. Wild 



wo 97/33000 PCT/US97/03499 

19. 

type probes can be restriction site probes, fragmenting probes, or capmre probes 
comprising a wild type nucleotide sequence that when hybridized to a 
complementary mutation-containing region of a target nucleic acid resuJis in a base 
mismatch bulge or loop structure. WUd type refers to a standard or reference 
nucleotide sequence to which variations are compared. As defined, any variation 
from wild type is considered a mutation, inchiding naturally occurring sequence 
polymorphisms. 

The term complementary refers to the formation of sufficient hydrogen 
bonding between two nucleic acids to stabilize a double-stranded nucleotide 
sequence formed by hybridization of the two nucleic acids. 

A single^straoded region ccanprises a portion of a nucleotide sequence that 
is capable of being selectively cleaved by single-strand-specific cleaving reagents 
or strucnire-specific endonucleases, wherein the portion of a nucleotide sequence 
can range in size from a single phosphodiester bridge, i.e. the phosphodiester bond 
across from a nick, to a nucleotide sequence ranging from one to 4S0 nucleotides 
in length which are not hybridized to a complementary nucleotide sequence or 
region. 

The types of mass spectrometry used in the invention include ESI or 
MALDI, wherein the MALDI method may optionally include time-of-flight. The 
significant multiple charging of molecules in ESI and the fact that complex mixftiie 
analysis is generally required mean that the ESI mass spectra will consist of a 
great many spectral peaks, possibly overiapping and causing confusion. Because 
the MALDI MS approach produces mass spectra with many fewer major peaks, 
this method is preferred. 

The methods described herein do not require sequencing of the target 
nucleic acid (using the sequencing methods that require four different base-specific 
chain termination reactions to determine the complete nucleotide sequence of a 
nucleic acid) in order to determine the nature and presence of a mutation within 
the target nucleic acid. 

For an initial mutation screen, a useful range of fragment sizes that will 
allow detection of a poim mutation is around 10 to 100 bases. This size range is 
where mass spectrometry presently has Ae necessary level of mass resolution and 



wo 97/33000 PCT/US97/03499 

20. 

accuracy. Thus, the fragmentation methods used in this invention are designed to 
produce from the target nucleic acid, a set of nonrandom length fragments ranging 
up to 100 bases in size. For puiposes of this invention, fragmentation methods 
that produce a set of random length fragments are not desirable due to the limited 
reproducibility of such fragments, the limited information available from mass 
spectrometry analysis of such fragmems, and die likelihood of spectral overlap 
from randomly generated fragments. For example, nonrandom fragmentation 
permits determination of the mass, base composition, and location of the set of 
NLFs relative to the target nucleic acid, whereas random fragmentation methods 
do not. 

Existing mass spectromenic instrumentation in the case of MALDI-TOF 
MS optimally has a mass accuracy of about 1 pan in 10,000 (0.01 four times 
what is necessary for detecting a single base change in a 50-base long single- 
stranded DNA fragment. Utilizadon of mass-modifieu nucjcotides (to be described 
later) and nearby masses as internal calibrants, provides optimal resohition and 
mass accuracy of larger nucleic acids, and can extend die usable mutation 
detection range up to 100 bases, if not higher. Continued advances in mass 
spectrometric instrumentation will also push this range higher. 

Examples of the resolving capabilities of MALDI-TOF MS are displayed in 
FIG. lA and IB. FIG. 1 shows the positive ion TOF mass spectra obtaimd from 
200 6noles of DNA in the matrix 3>HPA. FIG. 1 A (top figure) shows two single- 
stranded PGR products of lengths 71 and 72 (mass difference == 305 Da » 
Adenosine) as well as the 72mer and 72mer + a smgle matrix adduct (M) (mass 
difference « 139 Da) to be well resolved (FWHM resolution - 240). FIG. IB 
(bottom figure) shows an 88 base length single-stranded product having a 
resohition of 330. Both spectra display high enough accuracy and resolution to 
detect a point mutation if one were present. 

These unique properties of mass specnometry, MALDI-TOF MS in 
particular, to separate nucleic acid fragments and identify their mass exactly and 
the methods taught herein provide novel methods for the screening of target 
nucleic acids and identification of changes in base composition that might result 
from genetic mutation. 



wo 97/35000 PCT/U597/03499 

21. 

Improving Mass Accuracy By Internal Calibration and Internal 
Self-Caubration 

Mass spectrometers are typically calibrated using analytes of known mass. 
A mass spectrometer can then analyze an analyte of unknown mass with an 
associated mass accuracy and precision. However, the calibraticm, and associated 
mass accuracy and precision, for a given mass spectrometry system (including 
MALDI-TOF MS) can be significantly improved if analytes of known mass are 
contained within the sample containing the analyte(s) of unknown massXcs), The 
inclusion of these known mass analytes within the sample is referred to as use of 
internal calibrams. External calibrants, i.e. analytes of known mass that arc not 
mixed in with the set of oomandom length fragments of unknown mass and 
simultaneously analyzed in a mass spectrometer, are analyzed sq)aiately. External 
calibrants can also be used to improve mass accuracy, but because they are not 
analyzed simultaneously with the set of fragments of ufiknown mass, they will not 
increase mass accuracy as much as internal calibrants do. Another disadvantage of 
using external calibrants is that it requires an extra sample to be analyzed by the 
mass spectrometer. For MALDI-TOF MS, generally only two calibrant molecules 
are needed for c<»nplete calibration, although sometimes three or more calibrants 
are used. All of the embodiments of the invention described herein can be 
performed with the use of internal calibrants to provide improved mass accuracy. 

Using the methods described terein, one can obtain a mass spectrum with 
mmierous mass peaks corresponding to the set of nonrandom length fragments of 
the gene or target nucleic acid under study. If no mutation is present in the target 
nucleic acid, all of the mass peaks corresponding to the nonrandom length 
firagments will be at raass-to-charge ratios associated with the set of NLFs from 
the wild type target nucleic acid. However, if the target nucleic acid contains a 
mutation, usually no more than one or two of the mass peaks wiU be shifted in 
mass, leaving the majority of mass peaks at unaltered locations. In a preferred 
embodiment of the invention, a self-calibracion algorithm uses these unmutated or 
iM)iq>olymorpliic NLFs for internal calibration to optimize the mass accuracy for 
analysis of the NLFs conuiniqg a mutation, thus requiring no added calibrant(5), 
simplifying die calibration, and avoiding potential spectral overlaps. In a given 



wo 97/33000 PCT/US^ya3499 

22. 

sample, however* it will not be known a priori which mass peaks, if any, are 
altered or shifted from their expected masses for Ihc wild lype NLFs. 

The self-calibration algorithm begins by dividing up the observed mass 
peaks into subsets, each subset consisting of all but one or two of the observed 
mass peaks. Each data subset has a different one or two mass peaks deleted from 
consideration. For each subset, the algorithm divides the subset further into a first 
group of two or three masses which are then used to generate a new set of 
calibration constants, and a second group which will serve as an internal 
consistency check on those new constants. The internal consistency check begins 
by calculating the mass difference between the m/z values calculated for the 
second group of mass peaks and the values corresponding to reasonable choices 
for the associated wild-type NLFs. The internal consistency check can thus take 
the fonn of a chi-square minimization where the key parameter is this mass 
difference. Tlie algoriihrn finds which data subset has the lowest sum of the 
squares of these mass differences resulting in a choice of optimized calibration 
constants associated with group one of this data subset. 

After new self-optimized calibration constants are obtained, the mass-to- 
cbaige ratios are determined for the mass peaks omitted from the data subset; 
these are the nonrandom length fragments suspected to contain a mutation. The 
differences from the observed mass peaks for the wild type NLFs are then used to 
determine whether a mutation has occurred, and if so, what the nature of this 
mutation is (e.g* the exact type of deletion, insertion, or pomt mutation). This 
self -calibration procedure should yield a mass accuracy of approximately 1 part in 
10,000, 

Fragmentation of Target Nucleic Acids 

Fragmentation, of a target nucleic acid is important for several reasons. 
First, fragmentation allows direct analysis of large segments of a gene or other 
target nucleic acid using a single PCR amplification, eliminating the need to 
multiplex or run separately many smaller-segment PCR reactions. 

Second, sequencing of thousands of bases of a gene or other target nucleic 
acid, by mass spectrometry or otherwise, is a complex and expensive process. 



23, 

With current capabilities in MALDI and ESI, it is impractical to sequence nucleic 
acids greater than 50-100 bases in length. Consequently, in order to rapidly 
screen large genetic regions or target nucleic acids using mass spcctromctric 
nucleic add sequencing, an impractical and cumbersome number of independent 
sequencing reactions are necessary to cover the entire genetic region of interest. 
Therefore, for screenitig large genetic regions or torgei nucleic acids for a wide 
range of potential mutations using mass spectrometiy, fragmentation of amplified 
target nucleic acids rangmg from 100 to 1000 base pairs (bp) facilitates faster 
screening of larger target nucleic acids or genetic regions of interest. 

Sequencing can identify the exact location and nature of a genetic mutation 
in a target nucleic acid, but requires the use of many primers in many separate 
reactions. Mutations, especially for heterozygous samples analyzed using 
nuorescencc-based systems, are often difBcult to identify with confidence. Using 
the fragmentation methods described herein, a heterozygous sample would yield 
two distinct mass spectral peaks, correlating to the different masses of the mutant 
and wUd type nucleic acids. Accordingly, the methods described herein can be 
used to detect a mutation in a target nucleic acid unambiguously. 

Third, mass spcctromctric analysis of smaller nucleic acid fragments, 
ranging in size from 2 to 300 bases, more preferably from 10 to 100 bases in 
lengtij, is desirable because die smaller nucleic acid fragments result in: 

(a) more specific localization of any mutations than for larger sized nucleic 
acid fragments, 

(b) superior mass accuraqr and resolution of nucleic acid fragments in tiiis 
mass ninge» and 

(c) a multiplicity of mass peaks that can be used as internal self-calibration 
standards, further improving the mass accuracy. 

For analysis with MALDI-TOF MS, Uie goal of fragmentation is to 
produce a set of nonrandom length fragments ranging in length from 2-300 bases, 
preferably from 10-100 bases in leijgth. The range of lengUis serves to better 
sq)aratc and resolve the fragmem peaks in die resultix^ mass spcctnmi. 

Fragmentation of target nucleic acids larger than 100 bases in lengtii can be 
accomplished using a number of means, including cleavage witii one or more 



WO97y33000 PCT/lf 8^7/03499 

24. 

DNA restriction endonucleases targeting specific sequences within double-stranded 
DNA, chemical cleavage at structure-specific and/or base-specific locations, 
polymerase incorporation of modified nucleotides that create cleavage sites when 
incorporated, and tai^eted structure-specific and/or sequence-specific nuclease 
treatment. 

An exemplary case is where a larger target nucleic acid» e.g. 500 bases in 
length, is nonrandomly fragmented to produce 10 to 30 nonrandom length 
fragments that can all be individually resolved by MALDI-TOF mass 
spectrometry. Two different nonrandom length iEragments having the same number 
of bases can still be resolved from each other by mass spectrometiy when they 
differ in base composition and consequently in mass. Gel electrophoresis methods 
typically cannot resolve equivalent length fragmenu. 

For example, for a S kilobase pair O^b) target nucleic acid to be fully 
analyzed, using nonnmdom lengih fragments with an average size of 30 bases, 
approximately 170 nonrandom length fragnients would need to be screened. 
Typically, the target nucleic acid would be amplified by a number of DNA 
amplifications. 10-20. in order to reduce the number of fragments to be 
analyzed in any given sample. Each amplified target xmclcic acid product would 
be digested using restriction endonucleases, often with four-base recognition sites 
to produce the optimal size fragments. It is preferable that the fragments vary in 
size to simplify the mass spectral data, e.g. 32 bp +- 28 bp + 27 bp + 37 bp + 
...p although, as stated above, nonrandom length fragments of the same size could 
potmtially be analyzed if their base compositions vary enough to minimize spectral 
overlap. 

A schematic of the process along with a hypothetical mass ^ctnun is 
shown in FIG. 2. FIG. 2 illustrates a 161 base target nucleic acid that has been 
PCR amplified and fragmented using restriction endonucleases. The resulting 6 
nonrandom length fragments are produced. When the laser desoiption process 
occurs, during MALDI-TOF mass spectromenic analysis, the 6 double-stranded 
fragments are mostly denatured and the resulting 12 single-stranded nonrandom 
length fragments are ionized and detected. Shown at the bottom of HO. 2 is a 
simulated mass spectral data plot with all the mass peaks resolved. 



^O"'"** PCr/l.S»7/03499 

25. 

As can be seen in FIG. 2 it is very common that restriction endonuclease 
treatment will produce a number of complementary fragments with the same 
number of bases, e.g. two at 19 and two at 32. The presence of these equal- 
length ftagments places higher constraints on the required resolution for 
distinguishing all of the different peaks. It is also not uncommon for the two 
equal-length, complementary fragments to have identical or nearly identical mass 
values, leaving the possibility that two complementary fragments will not be 
resolvable. 

Often samples will be heterozygous, containing a 50% mixmie of both the 
normal wiW type nucleic acid and the mutated taiget nucleic acid. In the case 
where the target nucleic acid carries a mutation in a heterozygous mix, one would 
observe a splitting of peaks within dte nonrandom length fragments containing the 
mutation. An example of this splitting is shown in HG. 3 where an A-T to T-A 
transvcrsion or base flip has occurred io one copy of the gene. The expected 
peaks would be half normal height since their concentrations are halved relative to 
homozygous concentrations. In this case, the difference between mutant and wild 
type peaks would be -9 Da which can be resolved in the 32 base long fragment. 
The presence of wild type peaks provides internal seif-calibronts allowing highly 
accurate mass differences (as oi^sed to absolute mass) to be used to detetntine 
the base conqiosition change. 

The methods described herein permit MALDI-TOF MS analysis of 
nonrandom length fragments which has a mass accuracy of approximately 1 pan m 
10,000. The use of internal self^ah^rants makes it possible to extend this level of 
accuracy up to and potentially beyond 30.000 Da or 100 bases. ITus mass 
accuracy enables exart sizing of nucleic acid fragmente and the determination of 
the presence and nature of any mutation, including point mutations, insertions and 
deletions, even in a heterozygous environment. Further described heiein are 
methods for improving the resolution of individual fragments by means including 
elimination of equal-length complementary pairs through the use single-strand- 
targeted fragmentation and/or isolation procedures, and the incorporaUon of mass- 
modified nucleotides to oihaiKe the mass difference between similar sired 
fragments and/or mutant and wild type fragments. In addition, these methods 



wo 97/33000 PCr/US97/03499 

26, 

provide for the removal of salts and other deleterious materials as well as a means 
for the removal of unwanted nucleic acid fragments prior to mass spectroscopic 
analysis. 

Mass Resolution^ Mass Accuracy, and the tfs£ of Mass-Modified 
Nucleotides 

Any of the embodiments of the invention described herein optionally 
include nonrandom length fragments having one or more nucleotides replaced with 
mass-modified micleotides, wherein said mass-modified nucleotides comprise 
nucleotides or nucleotide analogs having modifications chat change their mass 
relative to the nucleotides that they replace. The mass-modified nucleotides 
incorporated into the nonrandom length fragments of the invention must be 
amenable to the enzymatic and noi^nzymatic processes used for the production of 
nonrandm length fragments. For example, the mass-modined nucleoiides must 
be able to be incorporated by DNA or RNA polymerase dmii^ amplification of 
the target nucleic acid. Moreover, the mass-modified nucleotides must not inhibit 
the processes used to produce nonrandom length fragments, including, infer alia, 
specific cleavage by restriction cndonucleases or strucmre-specific endonucleases 
and digestion by single-strand specific endonucleases, whenever such steps are 
used. Mass-modifications can also be incorporated in the nonrandom length 
fragments of the invention after the enzymatic steps have been concluded. For 
example^ a number of small chemicals can react to modify specific bases, such as 
keihoxal or formaldehyde. 

Any or all of the nucleotides in the nonrandom length fragments can be 
mass-modified, if necessary, to irK:rease the spread between their masses. It has 
been shown that modifications at the C5 position in pyrimidines or the N7 position 
in purines do not prevent their incorporation into growing nucleic acid chains by 
DNA or RNA polymerase. [L. Lee et al. ^'DNA Sequencing with Dye-Labeled 
Terminators and T7 DNA Polymerase: Effect of Dyes and dNTPs on 
Incorporation of Dye*Terminators and Probability Analysis of Termination 
Fragmems" Nuc. Acids. Res, 2Q» 2471 (1992)] For example, an octynyl moiety 
can be used in place of methyl on thymidine to alter the mass by 94 Da. 



27. 

Mass-modifying groups can be, for example, halogen. tXkyl, ester or 
polyester, ether or polyether, or of the geoeial type XR. wherein X is a linking 
group and R is a mass-modifying group. The mass-modifying group can be used 
to introduce defined mass increments into the nonrandom length fragments. One 
of skill in the ait will recognize that there are numerous possibilities for mass- 
modifications useful in modifying nucleic acid fragments or oUgonucleotides, 
including those described in Oligonucleotides and dialogues: A Practical 
Approach. Eckstein cd. (Oxford 1991) and in PCT/US94/00193. which are both 
incorporated herein by leference. 

At larger mass ranges (30.000-90,000 Da), the mass resolution and mass 
accuracy of current MALDI-TOF mass spectrometeis wiU not be sufficient to 
identify a single base change. For this reason, it may be preferable to increase the 
useful mass range artificiaUy by substituting standard nudeoiidcs within either a 
target nucleic acid or a nonrandom length fragnsent with mass-modii'ied nucleotides 
having significantly larger mass differentials. Use of mass-modified nucleotides 
applies as well to the mass range below 30.000 Da. Mass modification can 
generally increase the quality of the mass spectra by enlaixing the mass differences 
between NLFs of similar size and composition. For example, mass-modified 
nucleotides can increase the minnnum mass difference between two nonrandom 
length fragments that are identical in base condition excqit for a single base 
which is an A in one NLF and is a T in the other. NormaUy. these two NLFs wiU 
differ in mass by only 9 Da. By incorporating a single mass-modified nucleotide 
into (m of the bases, the mass difference can be >20 Da. The spectra in FIG. 4 
depict the influence mass-modified nucleotides can have on fragment resotation. 
One example of the many possible mass nrnUfications useful in this invention is 
the use of 5-(2.heptynyI)-deoxyuridjne in place of thymidine. The leplacement of 
a mediyl group by heptynyl changes the mass of this particular nucleotide by 65 
Da. An A to T transversion in a nudeic acid ftagment in which all thymidine 
bases have been replaced with 5^2-heptynyl)Kieoxyuridine would produce a peak 
shifi of 56 Da as opposed to 9 Da for the same nucleic acid fragments without the 
mass-modified nucleotides. The use of mass-modified nucleotides is especiaUy 
impoitant in the analysis of NLFs derived from RNA. Normally, the masses of C 



wo 97/33000 PCT/US97«3499 

28. 

and U vaty by only 1 Da, making it practically impossible to detect C to U or U 
to C point mutations within a given fragment. 

Benefits of Analyzing Single-Stranded Nijcleic Acids 

The goal of this invention is the accurate determination of the masses of a 
set of resolved oonrandom length fragments and correlation of diis data to the 
characterization of any mutation, if present. The embodiments of this invention 
include mass specm>metric determination of masses of the members of a set of 
single-stranded nonrandom length fragments as well as mass determination of the 
members of a set of mass-modified, double-stranded nomandom length fragments. 
The pieferred embodiment is to detect mutations in a target nucleic acid 
comprising obtaining a set of nonrandom length fragments in single-stranded form, 
wherein the single-stranded nonrandom length fragments are derived from one of 
either the positive or the negative strand of the target nucleic acid or where the set 
is a subset of fragments derived from both the positive and the negative strands of 
the tat:get nucleic acid. The examples of single-stranded methods described herein 
focus on fragments derived from the positive strand. 

FIG. 2 and 3 ilhistrate that each double-stranded nonrandom length 
fragment^ comprising two complemcntaiy strands, produces two peaks in the mass 
qpectrum corresponding to the denatured single strands. The additional peaks 
from double-stranded nonrandom iei^th fragments as compared to single-stranded 
nonrandom length fragments add to congestion of mass peaks in the mass spectra, 
as well as introducing the possibility that it may be extremely difficult* if not 
impossible, to resolve the complemcnlary fragments if they have nearly or exactly 
identical base compositions. Furtbennore, some portion of the double-stranded 
nonrandom length fragments do not fully denature, and mass peaks corresponding 
to the double-stranded products increase the spectral con^gestion. 

Because spectra using both strands contain a two-fold redundancy in data, 
since any mutation in one strand will be present within its complement, it is 
reasonable to remove one strand prior to mass spectrometric analysis and still 
produce all of the data necessary for complete mutation analysis. For these 
reasons, it is the preferred embodiment to analyze a set of single strands where 



wo 97/33000 

29. 

only one of the two complementary sets nucleic acid fragments representing the 
fiill target sequence is used. 

HG. 5 shows die expected spectrum if only die nomandomly fragmented 
positive strand of a target nucleic acid from FIG. 3 is anaJyzed by mass 
spectrometry. Analysis of one of the two complcmentaiy strands of the double- 
stranded nonrandom length fragments halves the number of expected peaks within 
the mass spectra, allowing mote total fragments to be resolved and the possibUity 
that longer total sized target nucleic acids can be analyzed at one time. Removal 
of one of the two strands from each noorandom length fragment eliminates the 
greatest source of complication for each spectra. A number of methods for 
isolating and preparing both siqgle-stranded and double-stranded nonrandom length 
fragments for mass spectrometry are described herein. 

Methods of Nonrandom FStAGMSivrATiON of Takget Nucleic Acids 

The methods of the invention ail Involve obtaimqg bom a target nucleic 
acid a set of resolvable, nonrandom-lcngtfi fragments and determhimg the mass of 
die members of diat set using mass spectrometry without sequeodng Oie target 
nucleic acid. All of die methods described herein involving mass spectrometry 
inchide inter alia two types of mass spectrometry, electroq)iay ionization (ESI) 
and matrix-assisted laser (tesoiption/ionization time-of-flight (MALDI-TOF). In 
addition to the restriction endonuclease approach to nomandomly fragmenting a 
target nucleic acid, there are a number of oflier approaches which are described 
below. 

NONRAfOWM F^AGMErn-AUGN USING ReSITUCTION SITE PROBES 

Target nucleic acid can be nomandomly fragmented using hybridization to 
nucleic acid, restriction site probes followed by cleavage with one or moie 
restriction endonucleases die recognition sequences of which arc contained in the 
restriction site probes used. •Restriction site probes" are oligonucleotides that 
when hybridized to single-stranded target nucleic acid at specie sequences form a 
complete double-snanded tecognidon site deavable using restricUon 
endonucleases. The use of restriction site pn^ is illustrated in FIG. 6. 



WOmaOOO PCT/US97AI3499 

30. 

The sequence of a wild type target nucleic acid can be analyzed to 
detennine which restriction sites would result in an ideal spread of members of a 
set of NLFs. The restriction site probes are then made using well-known synthetic 
techniqties. The restriction site probes can range from 6 - 100 nucleotides in 
lengdi, preferably from 10-30 nucleotides in length. One advantage of using very 
short restriction site probes is that after cleavage with the selected restriction 
endonucleases, the mass of the members of the set of NLFs having cleaved 
restriction site probes attached can be directly determined in the mass spectrometer 
without requiring an isolating step to remove the cleaved restriction site probes. 
On the other band, if the cleaved restriction site probes arc intended to be used 
also as capture probes, then the restriction site probes roust either have a first 
bindii^ moiety that is capable of binding to a second binding moiety attached to a 
solid support or the restriction site probes must have at least one additional 
nucleotide sequence that is complementary to another probe that is bound to a 
solid support. A "capture probe" is an oligonucleotide that comprises a portion 
capable of hybridiziiig to a nucleic acid, such as a target nucleic acid or a 
nonrandom length fragment, and a binding moiety that binds the capture probe to a 
solid phase, either through covalent binding or affinity binding, or a mixture 
thereof, A capture probe can itself bind to a solid support via binding moieties 
(direct c^^ture) or can bind to a solid support via another capture probe chat binds 
to a solid support (indirect capture). Also, when the restriction site probe is also 
used as a capture probe, die preferred range is from 30-50 nucleotides in lengthy 
to stabilize the hybridization of the capture probe. By using larger restriction site 
probes complementary to singular locations on die target nucleic acid it is possible 
to prevent a restriction enzyme from cutting at all possible locations in a target 
nucleic acid where restriction sites for a particular restriction endonuclease appear, 
e.g. cutting at only 5 or 10 restriction sites within a single*stranded target. This is 
another tool that can be used Co produce the optimal nonrandom length fragment 
set or subset. 

An alternative form of restriaion site probe is the universal restriction 
probe as described by S^balski. [W. Szybalski **Universal Restriction 
Endonucleases: Designing Novel Cleavage Specificities by Combining Adapter 



wo 97/33000 PCT/UW/03499 

31. 

Oligodcoxynucleotide and Enzyme Moieties/ Gene 40, 169 (1985) (incorporated 
by reference herein)] These untvenal restriction probes comprise two regions, the 
first region being single-stranded and compJemeotary to a specific sequence within 
the target nucleic acid, and die second region being double-stranded and contauiing 
die restriction recognition site for a particular class US restriction cndoniwleasc. 
Class US restriction endonucleases cleave double-stranded DNA at a specific 
distance from their recognition sequence. By using this property » and the 
universal restriction site probe design, it is possible to nonrandomly fragment a 
single-suanded DNA taiget at viitually any sequence, providing the means to 
better control the selection of fragment sizes. It is also possible to mix standard 
restriction site probes and universal restriction probes in a single reaction. 

In this approach, a positive single-stranded target micleic acid is hybridized 
to one or more restriction site probes diat are complemeniaty to one or more 
rcstrictjon cndonuclease recognition sequences wilhm the target nucleic acid. Upon 
hybridization of the restriction site probes to the target nucleic acid, hybridized 
target nucleic acids are formed, comprising double-stranded regions where the 
restriction site probes have hybridized to die target nucleic acid and at least one 
single-stranded region where the target nucleic acid remains unhybridized to a 
restriction site probe. The double-stranded regions of the hybridized target nucleic 
acids are recognition sites for cleavage by one, two or more restriction 
endonucleases. After the fonnation of hybridized target nucleic acids, die 
hybridized target nucleic acids are digested wiUi one, two or more restriction 
endonucleases, the recognition sequences of which are contained within the 
double-stranded regions. 

The resultiQg nonrandom length fragments have at least one cleaved 
restriction site oligonucleotide probe annealed. In some cases, these cleaved 
probes will be of a size too small to remain hybridized to the target fragments. 
These nonrandom length fragments can either be purified widi die cleaved 
restriction site oligonucleotide probes attached, or the NLFs can be purified from 
the cleaved oligonucleotide restriction site probes. Botfi types of purification can 
be accomplished using a variety of techniques known in the art, including 
filtration, precipitation, or dialysis. The preferred approach is to capture die 



wo 97/31000 PCT/US97/03499 

32. 

NLFs to a solid support. The set of nourandom length fragments can be directly 
captured to a solid support themselves using a number of means including a 
binding moiety such as biotin incorporated at numerous base positions thrt)ughout 
the NLFs. Or the NLFs can be indirectly captured to a solid support via 
hybridization to one or more capture probes that is itself bound to a solid support. 
The capture probe can comprise the fiilMength strand of the target nucleic acid 
that is compleraentaiy to the strand from which the nonrandom length fragments 
were derived. Alternatively^ the capture probes can be a set of capture probes 
each containing at least one sequence complementary to said nonrandom length 
fragments. 

By combining an asymmetric amplification method to produce single- 
stranded target nucleic acids with the use of restriction site probes, as described 
herein^ one can produce predominantly the desired set of single*stranded NLFs. 
The restriction site probes used to produce the recogiution sites may ccpi!rif>' with 
the NLFs btu can be designed so that they do not interfere with the majority of the 
mass ^)ectra. For example, the restriction site probes can be designed so that 
after cleavage their final sizes are less than 20 bases in length and the nonrandom 
length fragments can have sizes in the range of 20 to 100 bases. 

The methods described above can also be modified with the use of 
uncleavable restriction probes. These uncleavable probes, synthesized whh a 
restriction eodonuclease leststam backbone such as phospborothioate, 
boranophosphate, or methyl phosphonate, can be used to keep the target nucleic 
acid NLFs tethered together fbUowtpg restriction digest and can provide a different 
approach to inirification of the NLFs. 

ntAGMSmilON USING RtAGMEIfnNG PROBES AISD SiNGlJS'STRAND-SFECinC 

Cleavage 

While the use of restriction endonucleases in various combinations and in 
multiple digests can be an effective approach to fragmentation of the target nucleic 
acid, when a target presents long sequence lengths ( > 100 bases) that do not 
contain any restriction sites, alternative nonrandom fragmentation techniques are 
prefened. Long > 100 base fragments will be difTicuIc to probe with sufficient 



WO97y33000 PCr/US97/03499 

33. 

mass accuracy to dctcnnine if a base change muution has occurred. One way to 
control the size of ifragmciis is through the use of fragmenting probes and single- 
strand-specific endonucleases. 

Fragmenting probes are defined as nonrandom length, single-stranded 
oligonucleotides complementary to selected regions of a single-stianded target 
nucleic acid, and are used through hybridization to define and differentiate within 
the target nucleic acid regions that are double-sttanded versus regions that remains 
single-stranded. Following differentiation by hybridization the single-stranded 
regions are subjected to cleavage. As is the case for all of the mediods described 
here that utilize oligonucleotides, the fragmentize probes may be comprised on 
DNA, RNA or modified fyims of nucleic acid such as phosiAorothioates, methyl 
phosphonates or peptide nucleic acids. Three examples of single-stiand-specific 
nucleases that can be used in these methods are Mung bean nuclease, Nuclease SI, 
and RNase A. These enzymes cut smglc-strar*ded DNA or RNA exclusively and 
act as both exo- and endonucleases. 

An example of how these probes and enzymes are used follows. A set of 
fragmentiqg probes of defined size and sequence are designed to hybridize to 
conqplementary regions of the target nucleic acid< It is preferable that the target 
nucleic acid be primarily if not entirely single-stranded. Use of a T7 or SP6 RNA 
polyiMrasc transcripiion system for final amplification is a simple approach to 
producing the required single-stranded target nucleic acid. Asymmetric PCR can 
also be utilized to produce primarily single-stranded target. 

FIG. 7 shows how different portions of the single-stranded target nucleic 
acid are hybridized to the oligonucleotide probes. Followii^ hybridization, any 
regions of the target nucleic acid diat remain single^stranded are cleaved using a 
single-stiattd-specifk endo/exonuclease, such as SI Nuclease, MuQg bean 
nuclease, or RNase A. The size of the single-stranded region can be as small as a 
single phosphodiester bridge, i.e. the phosphodiester bond across from a nick. SI 
nuclease is capable of cleaving across from nicies. The end products are double- 
stranded hybrids comprised of two equal lepgtb strands: one strand is a member of 
the set of nonrandom length fragments derived from the target nucleic acid and the 
other strand is a member of die set of fragmenting probes, wherein said NLFs are 



WOy7/33000 PCTAIS97>03499 

34. 

hybridized to said fragmenting probes. Either these double-stranded hybrids or 
isolated single-stranded nonrandom length fragments derived from said target 
nucleic acid can be used for MALDI-TOF mass spectrometric analysis. 
Preferably, the analysis of the single-stranded nonrandom length fragments derived 
from said target nucleic acid provides a simpler mass spectrum. It should be 
noted that when the complementary strands are a mixed DNA/RNA hybrid there 
wiQ be a significant mass difference between the two strands in all cases, making 
each strand more easily resolvable in the mass spectrum. 

Unlike the restriction endonuclease nonrandom fragmentation approach, 
with this method it is possible to use a DNA/RNA l^brid providing a convenient 
route toward digesting the fra;gmeming probes after ncuirandomly fragmemtng the 
target nucleic acid. Isolation of the set of NLFs from the set of iiagroeniing probes 
is another means to simplify the mass spectra. Because of the different chemical 
nature of the two strands of the hybrid, it is possible to utilize DNA- or RNA- 
specific enzymes to digest the fragmenting probes. As an example, DNase can be 
used to digest fragmenting probes comprised of DNA while leaving nonrandom 
length RNA fragments intact or RNase can be used to digest RNA probes while 
leavmg nonrandom length DNA fragments intact. It is also possible to utilize 
different chemistries to specifically digest one sttand or the other. These 
chemistries include the use of acid to digest DNA or base to digest RNA as well 
as a multiplicity of other chemistries that can be use to cut modified versions of 
DNA or RNA. This differential cutting can be exploited to purify and analyze 
onty one of the two strands as described in a later section. 

Thus, another embodiment of this invention is a method of detecting a 
mutation in a DNA fragment from a DNA/RNA hybrid nucleic acid comprising 
obtaining a DNA/RNA hybrid wherein the DNA/RNA hybrid comprises a single- 
strand of a DNA fragment hybridized to a single-strand of a RNA fragment* 
digesting the single-strand of RNA using a RNA-specific reagent, including RNase 
or a base, dctennining the mass of the single-stranded DNA fragment using mass 
spectrometry » and comparing said mass to a mass of a wild type single-stranded 
DNA fragment. Another embodiment is a method of detecting a mutation in a 
RNA fragment from a DNA/RNA hybrid nucleic acid comprising obtaining a 



WO5r7/33000 pcrnmTm499 

35. 

PNA/RNA hybrid wherein the DNA/RNA hybrid comprises a single-strand of a 
DNA fragment hybridized to a single-strand of a UNA fragment, digesting the 
single-strand of DNA using a DNA-spccific reageni, including DNase or an acid, 
determining the mass of the single-stranded RNA fragment using mass 
spectrometry, and comparing said mass lo a mass of a wild type single-stranded 
RNA fragment. These embodiments can also be applied to a set of DNA/RNA 
hybrids, and using the DNA-specific or RNA-spccific digestion to leave a set of 
nonrandom lengdi fragments consisting of DNA fragments or a set of nonrandom 
leogth fragments consisting of RNA fragments. 

Complete digestion using restriction endonucleases produces a series of 
fragments that can be aligned end to end but do not overlap. With the use of 
fti^gmcntiag probes and singlc-strand-specific cleaving reagents described herein, 
one can design a set of sequence and size specific fragmenting probes that can be 
used to produce a set of noniandbm length fragments such that one or more 
members of the set comprise a nonoverlapping nucleotide sequence and a 
nucleotide sequence tiiat overlaps with a nucleotide sequence of another member of 
the set. The example shown in FIG. 7 uses a set of sequence and size specific 
fragmenting probes that overlap (e.g. split into two sets of hybridization reactions) 
to produce an overlapping set of nonrandom length fragments. The set of 
nonrandom length fragments that overlap could be nested. By using a set of 
overlapping nonrandom length fragments to screen for a mutation, one can more 
narrowly localize the region containing a mutation. If two overiapping 
nonrandom length fragments both contain the mutation, as is Ae case in FIG. 7, it 
is then known that the mutation exists within the small region of overlap. 
Conversely, if only one of the overlapping fragments conuins a mutation, it is 
known that the mutation cannot be in an overiapping region. This approach plus 
the ability to design certain fragmenting probes to be very small in size, e.g. 10 to 
20 bases (typical fragmenting probes will be anywhere between 10 and 100 bases 
in lengcb)» allows one to probe genetic regions that are known hot spots for 
mutation with greater detail. 

One variant of this method is to use single-strand-specific chemical reagents 
as a means for cleaving a target nucleic acid target into a set of nonrandom length 



wo 97/33000 PCT/US97m3499 

36. 

fragments. Several base-specific cleavage chemistries have been identified that 
cleave the nucleic acid baclcbone at base-speciftc sites chat are single-stranded and, 
under optimal conditions, demonstrate zero or extremely reduced cleavage levels 
at base-specific sites that- are double-stranded. As an option the target nucleic acid 
can be synthesized usiqg one or more modified nucleotides in order to make the 
backbone more vulnerable to chemical cleavage. By using fragmenting probes to 
hybridize to a target nucleic acid at all sites except (he specific locations where 
cleavage is desired, it is possible to limit cleavage to these single-stranded sites 
and create a sequence-specific set of nonrandom length fragments. The method, 
schematized in FIG. 8, can utilize one of a number of different chemistries that 
are known to be single-strand specific including hydrogen peroxide cleavage 
and/or 2-hydrt)peroxytetrahydrofuran cleavage at C. [P. Richterich et al. 
^'Cytosine specific DNA sequencing with hydrogen peroxide" Nuc. Acids Res. 22> 
4922 (1995): G. Liang, P. Gannet & B. Gold -The Use of 2- 
Hydroperoxytetrahydrofuran as a Reagent to Sequence Cytosine and to Probe Non- 
Watson-Crick DNA Structures" Nuc. Acids Res. 21. 713 (1995)]. Target nucleic 
acids that contain cleavage-modified nucleotides can be made by incorporation of 
modified nucleotide triphosphates during an amplification or polymerization step. 

A second variant of diis method is to create heterozygous hybrids between 
the wild type fragmenting probes and the target nucleic acid. By using 
fragmenting probes comprised of wild type sequence, any hybrids that form with 
mutant sequence containing a point mutation will create a base mismatch or bulge. 
If the mutation is a small nisertion or deletion, a looped out sequence will occur. 
With this heterozygous hybrid^ it is possible to use one of the stntcture-specific 
enzymes or chemistries described in the following section to create a mutation- 
specific cleavage at the site of a mutaticm. An exanq>le of the pattern of 
nonrandom length fragments produced is shown in FIG. 9. This aj^roach 
permits determination of the type and location of the mutation thai has occurred. 
Also as wiU be described, perfoimance of a mutation-specific cleavage relaxes the 
mass accuracy and resohition constraints, thus increasing the useful size range for 
the nonrandom length fragments to be analyzed with MALDI-TOF mass 
spectrometry to a range of several hundred bases. 



WO^/33000 PCr/US97/03499 

37. 

Mutation-Specific Cleavage Using STRucruRE-SPECinc Endonucleases 
Another nonrandoni fragmentation technique involves the use of mutation- 
specific cleavage at base mismatch regions, if present, using sinicnire-specific 
endonucleases or singlc-stiand-specific cleavage. Creation of mismarch regions 
requires hybridization between a mutation containing, single-stranded target 
nucleic acid and a set of one or more single-stranded complementary wild type 
probes derived from wild type sequence. Wild type probes can be restriction site 
probes, fragmenting probes, or capture probes conaprising wild type nucleotide 
sequence that when hybridized to a complementary mutation-containing region of a 
target nucleic acid results in a base mismatch bulge or loop structure. A base 
mismatch will be created at the location of the mutation. In one embodiment, the 
mutation containing poshive strand is hybridized to a complementaiy wild-type 
probe that comprises the entire negative strand. In the preferred embodiment, the 
complex of mutation containing positive strand hybridized to one or more 
complementary, wild type nucleic acid probes is fragmented usiAg either 
rcsuiction endonucleases, or fragmenting probes coupled with a single-straiid- 
specific cleavage reagent Any base mismatch regions between the set of wUd 
type probes and the set of NLFs can be specifically cleaved using one or more 
mismatch-specific cleaviAg rcagenu. Examples of these reagents inchide: 
stnicture-specific endonucleases such as T4 endonuclease VH, RuvC, MutY, or the 
endonucleolytic activity from the 5 '-3' cxonuclease subunit of thermostable DNA 
polymerases, sij^gle-strand-spccific enzymes such as Mung bean nuclease, SI 
nuclease or RNase A, and single-strand-specific chemistries such as 
hydroxylamine, osmium tetroxide, potassium permanganate, or peroxide 
modification of unpaired bases followed by a backbone cleaving oxidation step. 

This mismatch-specific cleavage is used to cleave the mutation-containing 
nonrandom length fragment at the site of the mutation, thus producing two smaller 
fragments from the larger mutation-containing fragment. This approach is an 
efficient and simple way to identify the exact location of a mutation as well as its 
type. The mismatch-specific cleavage used in combination with one of the 
nonrandom fragmentation methods described herein can be used to fragment a 



wo 97/33000 PCT/US97y03499 

38. 

large (>200 bases), single-stranded target nucleic acid into a set of smaller, mass 
resolvable nonrandom length fragments* 

Like EMC and CCM, the mismatch-specific cleavage approach utilizes a 
mismatch targeting reagent to cut at the point of mutation. The approach 
described herein improves upon the gel electrophoresis-based methods by focusing 
on relatively small fragments that take maximum advantage of the mass 
spectrometer's ability to detect the exact size of a fragmem leading to the 
identification of the exact location and nature of a mutation. The EMC and CCM 
methods must be followed by DNA sequencing in order to fuUy characterize a 
mutation. Using the methods described herein, a mutation in a target nucleic acid 
can be detected and its location and nature determined without any sequencing. 

An example of how a structure-specific enzyme like T4 endonuclease VII 
can be used for mismatch-specific cleavage is shown in FIG. 10. The first step 
involves two amplification reactions. First, a taif et nucleic acid suspected of 
containing a mutation is amplified. Second, the correspondiiig wild type target 
nucleic acid is amplified to create wild type probes. These two amplification 
reactions can be perfonned together in one tube if the target nucleic acid is a 
heterozygous mixture of mutant and wild type. For certain diagnostic procedures, 
it may be more efficient to produce the wild type probes separately prior to the 
screening process. The next steps involve fragmentation of the target nucleic acid/ 
e.g. a multiple digest of the target nucleic acid using more than one restriction 
endonuclease, and a step in which the fragments are mixed, denatured, and then 
annealed. The fragmentation and denaturing/annealiiv steps can occur in either 
order. The purpose of the denaturing/annealing step is to produce a mixture of 
hybrid target nucleic acids. In a 50:50 mixture of mutant target and wild type 
nucleic acids, four different products result: 25% homozygous mutant double- 
stranded nonrandom length fragments. 25% homozygous wild type double-stranded 
nonrandom length fragments, and 25% each of the two forms of heterozygous 
mutant/wild type hybrid nonrandom length fragments. See FIG. 10 (illustrating 
the use of wild type NLFs as wild type probes to generate a base mismatch with 
muunt NLFs). The heterozygous nonrandom length fragments contain at least one 
base mismatch at the site of mutation, i.e. the point(s) of sequence variation 



wo 97/33000 PCT/US97y03499 

39. 

between mutant and wild type. The next step involves treatment of the nonrandom 
length fragments with a mismatch-specific reagent that cleaves at the site of the 
base mismatch in the heterozygous mutant/wild type nonrandom length fragments. 
These new cleavages (the number of cleavage events will depend on the particular 
enzyme used) typically reduce the nonraiKiom length fragment containing the 
mutation into two smaller nonrandom length fragments. The 5056 of the mixture 
that contains the homozygous double-stranded nucleic acid fragments with no 
mismatches will not be deaved during the mutation-specific cleavage. 

Example schematic mass spectral plots are shown in FIG. lOB. An 
expected spectrum would show a reduction in the peak size of the nonrandom 
length fragment containing the base mismatch that is cleaved by the structure- 
specific cndonuclease (c.g. peaks 32+(Mut), 32+(Wt), 32-(Wt). and 32-(Mui)) 
and the introduction of several smaller peaks at lower masses than the mutant 
peaks representmg the set of heterozygous mutani/wild type NLFs that contain 
base mismatches (see peaks 8+(Mut), 8+(Wt). IN, 21-(Wt), 21-(Mut), and 
24-f-), These peaks coirespondii^ to the heterozygous NLFs containing base 
mismatches ate reduced in intensity but continue to be present since only 50* of 
the molecules exist in the heterozygous form that can undergo the muution- 
specific cleavage. 

It is possible to bias the peculation of the different 
heterozygous/horoozygous forms by performing the amplifications of the target 
nucleic acid asymmetrically. Thus, one can maximize the types of nonrandom 
length fragments yielding mutational data with the majority of the duplex formed 
during the annealinig process being heterozygous positive (+) snrand mutant and 
negative (-) strand wild type. 

While it is possible to observe similar patterns using gel electrophoresis 
techniques, the mass accuracy obtained by mass spectrometry provides the 
advant^e of accurate dctcmiination of the naftire of the mutation and the ability to 
determine the size and order of the two nonrandom length fragments created by 
the mutotion-specific cleavage. In the example in FIG. lOB. the resulting 
mismatch*specific cleavage fragments arc represented by sizes 8, II, 21, and 24 
nucleotides in length. Using clcctrophoretic techniques, it would be impossible to 



wo 97/33000 PCT/US97m3499 

40, 

differemiate the two mutant forms at 8 and 21. (fragments 24+ and 12- do not 
possess cbe mutant base and are identical in heterozygous fonns C and D), nor 
would it be possible to directly detennine which fragment is upstream (toward the 
5* end) and which iragment is downstream (toward the 3' end), e.g. in the positive 
strand it is 8+ that is upstream from 24+. By providing exact mass values, 
mass spectrometry allows these strands to be ordered based on mass value 
database comparison with the fragments expected from the known sequence of the 
wild type target nucleic acid. By completely identifying the location and nature of 
the mutation this mass spectirometric method eliminates any need for sequencing 
the target nucleic acid. 

FIG. lOB shows bow the mismatch-specific cleavage event adds complexity 
to the mass spech^. In the example shown, there are several locations where 2, 3. 
and even 4 different NLFs have the potential to overlap in the mass spectrum, 
making Lhc full specimm difficult to rssclve. As discussed previously* and shown 
in PIG. 5, the mass spectra can be greatly simplified by performing the mass 
spectrometric analysis on only the + or the - strands of the nonrandom length 
fragments. For example, FIG, 11 shows the set of nonrandom length fragments 
that are derived by analyzing only the + positive strand of the mutam target 
nucleic acid. By eliminating the homozygous nonrandom length fragments that are 
not mutation-spccifically cleaved and removiiig the negative stnmd from the mass 
spectrometric analysis, the total number of nonrandom length fragments to be 
analyzed can be reduced from 20 to 7, with no two mass peaks having the same 
number of nucleotides. Of course, in other situations, two peaks may be from 
nonrandom length fragments of the same length depending on the type of mutation 
present, but such situations will be infrequent. 

This mismatch-specific cleavage, like the incotporaiicm of mass-modified 
nucleotides, extends the usable mass range of the initial target nucleic acid for 
mass spectrometric analysis since the primary mass accuracy needs are in 
determining the reduced mass of the noiuandom length fragmenu created by the 
mutation-specific cleavage and not in determining the mass of the other nonrandom 
lengdi fragments that are unaffected by the muution-specific cleavage. • 



"^0 91^0^ ^^S9mU99 

41. 

It is not always necessary to fragment the target nucleic acid in tandem 
with mismatch-specific cleavage if the size of the nonrandom length fragments 
created by the mismatch-specific cleavage is smaU enough to fall into the usable 
mass range with the necessary mass resolution and accuracy. Target nucleic acids 
as large as 200 base pairs will yield at least one noraMdom length fragment 
created by the mutation-specific cleavage wherein the nonrandom length fragmenu 
can be a size less than 100 base pairs, e.g. a 200 bp target nucleic acid with a 
mutation at position 135 will produce nonrandom length fragments of 65 and 135 
after cleavage at the site of base mismatch. 



Fragmentation Vswg STRUcruRE-Specinc Endonvclrases to Cleave a 
Folded Target Nucleic acid 

Another nonrandom fragmentation method of the invention involves 
providing a target nucleic acid that is either a positive or a negative single-strand; 
providing condiUons pennitting folding of the singlc-stranded target nucleic acid to 
form a three-dimensional sirucDire having inu-amolecular secondary and tertiary 
interactions, and nonrandomly fragmenting the folded target nucleic acid with at 
teast one structure-specific endonuclease to fonn a sec of single-stranded 
Donrandom length fragments. A diagram of this procedure is provided in FIG. 12. 
An example of conditions that pennit foldiog of the single-stranded target nucleic 
acid are heating to denaouation followed by slow cooling to permit annealing to 
form a thermodynamtcally favored secondaiy and teitiary structure. The 
stiucture-qjecific endonucleases include: T4 endonuclease VII, RuvC, MutY, and 
the endonucleolytic activity from the 5*-3* exonuclease subunit of thermostable 
DNA polymerases. 

An alternative to die use of strucmre-specific endonucleases is the use of 
some of the same single-strand-specific chemical cleavage procedures describe 
earlier in the text. Because of the higher frequency with which these reagents 
might cleave relative to the saucmrc-specific endonucleases, it is necessary that 
the secondary and tertiary structures formed by the single-stranded target be more 
compact, linriiing the access of the chemical reagents to the various reactive 
nucleotides. Approaches to forming these more compact structures include 



wo 97/35000 PCT/U$97y03499 

42. 

performance of the reactions at lower temperature, under higher salt conditions, or 
the use of RNA versus DNA since RNA is known to form more complete 
secondary and teniary structures. Using this method, the cleavage reaction can be 
run to completion to produce a sundard set of nonrandom length fragments or run 
only partially with the potential of producing a nested set of products that can be 
analyzed by mass spectrometry or by electrophoresis methods. 

PURinCATION MFTHODS 

When analyzing nucleic acids, including nonrandom length fragments, by 
mass spectrometiy, there are several requirements that need to be met. 

First, as has been described earlier, is the need to produce fragments within 
the resolvable range and high mass accuracy range of the mass spectrometer. 

Second, is to elinninate from the sample, nucleic acid fragments that do not 
contribute to the analysis and may unnecessarily convolute the mass spectra. With 
analysis methods such as gel electrophoresis, a mixture of specifically labeled 
nucleic acid fragments (radioactive or by fluorescent tagged) can be visualized in 
the presence of other unlabeled oiicleic acid fragments that comigrate but are 
invisible and therefore do not convohite analysis of the gel data. The mass 
spectrcmetric methods described herein do not use any form of labeling that could 
render certain fragments invisible, e.g. the negative strand in a double-stranded 
product, and it is therefore necessary to remove such fragments prior to analysis. 

Third, is the need to produce samples of relatively high purity prior to 
introduction to the mass spectrometer. The presence of impurities, especially 
salts, greatly affects the resohition, accuracy and intensity of the mass 
spectrometric signal. Contaminating primers, residual sample genomic DNA, and 
proteins, all can affect tte quality of the mass spectra. 

In addition to the three requirements listed above h is also desirable for the 
methods to be amenable to automation, fast and inexpensive, providing an 
effective approach for detecting genetic mutations. 

Existing purification methods are all designed to worlc with labeled 
molecules that were typically analyzed by gel electrophoresis. As well as utilizing 
labels, electrophoresis is, to a certain degree, tolerant of impurities including salts 



wo 97/33000 PCT/US97fl)3499 

43. 

and proteins. For mass spcctromctric analysis, prior art purification methods such 
as precipitation combined with vigorous alcohol washes^ filtering and dialysis, and 
ion exchange chromatography are unsatisfactory because they camiot eliminate 
unwanted nucleic acid fragments and normally do not remove all salts from a 
sample. Solid phase approaches such as glass bead capture under high salt 
conditions, biotin/streptavidin binding, direct solid-phase covalent linkage, and 
capture via hybridiration to solid phase bound oligonucleotide probes can be used 
to eliminate unwanted nucleic acid fragments but typically require high levels of 
salt during many of the wash steps, rendering the products less pure and 
compromised for mass spectrometric analysts. 

The purifications methods of the present invention are better suited to mass 
spectromeuic analysis of nucleic acids than the prior art mediods. First, the 
methods herein physically isolate selected sets of nucleic acids from a multiplicity 
of impurities inchiding undesirable nucleic acid fragments^ proteins, salts, that 
would result in a poor quality mass spectrum. Second, the methods optionally use 
a solution comprising volatile salts such as ammonium bicart>onate, dimethyl 
ammonium bicarbonate or trimcthyl ammonium bicarbonate in any of the steps, 
including hybridization, endonuclease digestion or washing. These two differences 
are significant advantages over the prior art becatise: (1) physical separation of the 
desired set of nucleic add fragments for mass spectrometric analysis is better than 
the labelling methods of the prior ait that do not physically separate the target 
nucleic acids from a variety of other impurities that intafere with an accurate 
mass spectrum; and (2) the u$e of volatile salts in any of the steps precludes the 
need for any wash step known fai the prior ait to nseiely remove salts or inorganic 
ions. 

Pwbte Stran^i Fiapjient Qipftre Approacfaw 

There are a number of basic ways to purify DNA restriction products from 
salts and other small molecules including precipitation, filtering, dialysis, and ion 
exchaiige chromatDgraphy. While all of these methods are effective, they are not 
all equally useful for removing amplification primers, residual DNA, i.e. genomic 
DNA, or any proteins used. In addition, none of the basic approaches meets all of 



wo 97/35000 PCT/US97/03499 

44. 

Che requirements of automation, speed and cost. The approach that comes closest 
is the use of small ion exchange spin columns, which are somewhat expensive and 
not simple to integrate into an automated setup. These small ion exchange spin 
columns can, however, produce high quality nucleic acids for mass speccrometric 
analysis. A better alternative is the use of (magnetic) glass beads to 
capture/precipitate nucleic acids of a specific size range and allow them to be 
rigorously washed. However, this method, like all of the other prior an methods 
described above, does not allow for the removal of unincorporated DNA primer 
since they are of the same size as the nonrandom length fragments to be analyzed 
and cannot be simply differentiated. 

Another general approach to purification of double-stranded fragments is to 
direcdy capnire the target nucleic acid mi/or a set of nonrandom length fragments 
by one of three means: (A) hybridization to capture probes comprising a First 
binding moiety that specifically binds to a second binding moiety attached to a 
solid phase; (B) binding the target nucleic acid or the members of the set of NLPs 
each conq>ri5iQg a nucleotide sequence and a first binding moiety to a second 
binding moiety attached to a solid phase; or (C) direct covalent attachment of the 
target nucleic acid or the members of the set of NLFs to the solid support. Each 
of these methods has advantages and disadvantages. 

(A) Hybridization to solid support bound capture probes is straightforward, 
specific, and can be made thermodynamicaUy and kinetically favored by 
optimizing the size and concentration of the capture probes. Optimization is 
necessary since the set of NLFs would generally prefer to hybridize to their 
complements nrbcr than to the capture probes. (This approach also works well 
for single-strand isolation as described in the following section.) A variation is to 
bmd the probes to the solid phase after hybridization to target. Both 
biotin/streptavidin and covalent approaches for linking the probes to the solid 
phase are feasible. The principal concern with this approach is that maintenance 
of the hybridization, especially during wash steps, requires relatively high level of 
salts and makes it more difficult to produce a salt-free product for mass 
spectrometric analysts. Solutions to this problem include the use of relatively long 
capmre probes to increase melting temperatures or the use of volatile salts that can 



wo 97/33000 PCT/U597/03499 

45. 

be removed prior to mass spectrometric analysis. The use of volatile salts is 
described in more detail elsewhere. 

(B) Biotin coupling to streptavidin (or avidin) requires that any target 
nucleic acid or nonrandom length fragment to be captured contain a biotin. It is 
straightforward to capnire the target nucleic acid because biotinyJated primers can 
be used in the PCR amplification. In order to capture all of the fragments after a 
restriction digest, it is necessary to incorporate biotin into all of the fragments. 
Three possible routes for biotin labeling are. (1) the inclusion of a biotinylated 
nucleoside triphosphate during fragment synthesis, (2) the use of a DNA 
polymerase to fill in at 5* restriction overhangs using a biotinylated nucleoside 
triphosphate p and (3) the use of ligase to ligate a biotinylated oligonucleotide at the 
restricted ends of the nonrandom length fragments, where the oligonucleotides are 
either complementary to the restriction sequence overhangs or are capable of blunt 
end ligation. 

Each of the diree approaches have their problems but are feasible. Biotins 
incorporated in method (1) may inhibit the restriction endonucleases to be used and 
prevent the use of structure-specific nucleases in a second mutation-specific step 
since the biotin may be recognized as DNA modifications to be excised. Method 
(2) is more feasible but requires a preliminary cleanup step to exchange the normal 
triphosphates for biotinylated ones. Restriction sites are limited to enzymes that 
produce S* overtiangs. Method (3) is more generalizable than (2); its principal 
weakness is competition with larger fraptmits that will want to relegate. 
However, this ccmxpetition can be ovmomt by using an excess of the biotinylated 
linkers. 

(C) The approach of direct covalent attachment of NLFs or target to a solid 
support faces many of the same challenges as the biotin/strqitavidin approach but 
also inchides the need to design specific* *hot" (i.e. fast and efficient) bindirig 
chemistry working with low concentrations of material. 

The target or members of a set of NLFs can be covalently attached to a 
solid support using any of the number of mcthoda commonly employed in the art 
to iirunobilize an oligonucleotide or polynucleotide on a solid support. The target 



wo 97/33000 PCT/US97y|l3499 

46. 

or NLFs covalently attached to the solid support should be stable and accessible 
for base hybridization. ^ 

Covalent attachment of the urget or NLFs to the solid support may occur 
by reaction between a reactive site or a binding moiety on the solid support and a 
reactive site or another binding moiety attached to the target or NLFs or via 
intervening linkers or spacer molecules, where the two binding moieties can react 
to form a covalent bond. Coupling of a target or NLF to a solid support may be 
carried out through a variety of covalent atuchment functional groups. Any 
suitable functional group may be used to attach the urget or NLF to the solid 
support, including dbulfide, carbamate, bydrazone, ester, N-functionalized 
thiourea, fiinctionalized maleimide, streptavidin or avidin/biotin, tnercuric-sulfide, 
gold'Sulfide, amide, thiolester, azo, ether and amino. 

The solid support may be made from the following materials: celhjlose. 
nitrccellulcse, nylon membranes* control ied-pore glass beads, acrylamide gels, 
polystyrene, activated dextran, agarose, polyethylene, functionalized plastics, 
glass, silicon, aluminum, steel, iron, copper, nickel and gold. Some solid sitpport 
materials may require fiinctionalizatton prior to attachment of an oligonucleotide or 
capture probe. Solid supports that may require such surface modification include 
wafers of aluminum, steel, iron, copper, nickel, gold, and silicon. Solid support 
materials for use in coupling to a capture probe include functionalized supports 
such as the l.T-carbonyldiimidazole activated supports available from Pierce 
(Rockford, IL) or functionalized supports such as those commercially available 
from Chiion Corp. (Emeryville, CA). Binding of a target or NLF to a solid 
support can be carried out by reacting a free aiuino group of an amino-modified 
target or NLF with the reactive imidazole carbamate of the solid support. 
Disphu:eniem of the imidazole group resuhs in formation of a stable N-alkyl 
carbamate linkage between the target or NLFs and the support. 

The target or NLFs may also be bouiKl to a solid support comprising a gold 
surface. The target or NLFs can be modified at their 5*-end with a linker arm 
terminating in a thiol group, and the modified target or NLFs can be chemisorbed 
wid) high affinity onto gold surfaces (Hegner, et al.. Surface Sci. 291:39-46 
{1993b)). 



wo 97/33000 PCT/US97y»3499 

47. 

In all of the methods in which a solid-phase approach \s used, the double- 
stranded nonrandom length fragments can be rigorously washed to remove 
deleterious contaminants. Following washing it is necessary to release these 
fragments from the solid support for mass spectrometric analysis. The isolation of 
a set of NLFs may be performed on the same plate that is used within the mass 
spectrometer; Both the capture probe hybridization and biotin/streptavidin 
approaches can use heat and/or pH denaturation to disrupt the noncovalent 
interactions and afford release of the set of NLFs bound to the solid support. 
Alternatively, a cleavable Imkage can be incorporated between the firsi binding 
moiety and the NLFs. Any covalent coupling chcmistty will need to be cither 
reversible or it will be necessary to inchide a separate chemically cleavable linkage 
somewhere within the bound pnxluct. It may also be useful to use a chemically, 
cleavable linkage approach with the biotin/streptavidin strategies so that release of 
the double-stranded fragments can be performed under relative!)' mild conditions. 
In all cases the cleavable linkage can be located within the linker molecule 
connecting the biotin and the base (e.g.a disulfide bond in the linker), within the 
base itself (e.g. a more labile glycosidic linkage), or within the phosphate 
backbone linkage (e.g. replacemem of phosphate with a phosphoramidate). 

One alternative to these solid-phase approaches described above is to 
capture the target nucleic acids prior to nonrandom fragmentation with one or 
more restriction endonucleases. Rigorous washes to remove polymerase, salts, 
primers and triphosphates requited for amplification are followed by treatment 
with minimal amounts of restriction en^me under very low salt condxtiotis. This 
mixture is then directly analyzed in the mass spectrcnneier. Mass spectrometry 
can tolerate salts if their concentrations are low enough and a limited class of 
restriction enzymes can work under veiy low salt conditions. 

The low salt approach does limit the restriction sites that can be cleaved as 
part of the methods of detecting muutions. Many restriction endonucleases 
require a significant level of salt. An attractive alternative to limiting the 
restriction endonuclease cleavage reactions to low levels of salt is to ivplace the 
salts normally used with volatile salts. These salts, such as ammonium 
bicarbonate, dimethylammomum bicarbonate or trimethylammonium bicarbonate, 



"^091^ PCT/US97/03499 

48. 

can be removed prior to mass spcctrometric analysis through simple evaporation. 
Evaporation can be accelerated by placement of the sample in a vacuum, such as 
the mass specirometer sample chamber, or by heating the sample. 

Approaches to Capturing Single-Stranded FHAcmms 

As described earlier, analysis of single-stranded nonrandom length 
fragments is generally preferable since it provides a complete set of dau with die 
minimal number of fragments and therefore simplifies the spectra and facilitates an 
increase in the total length of nucleic acid that can be analyzed in a single assay. 
A number of approaches, as described above, can be taken toward the production 
of single-stianded fragments and their purification which includes the elimination 
of undesired fragments. 

If DNA restriction endonucleases are used to produce the nonrandom length 
fragments, it is necessary that the target nucleic acid have a doubie-stranded form 
prior to restriction, or more specifically, that die restriction endonuclease 
recognition sites be located in double-stranded DNA. The alternative to having 
fully double-stranded DNA prior to restriction is to hybridize restriction site 
probes to single^straoded DNA, wherein die restriction site probes are 
complementary to the restriction sites for selected restriction endonucleases. 

The basic known methods for DNA isolation - precipitation, dialysis, 
filtration and chromatography do not isolate single-stranded from double-stranded 
DNA. If these purification methods are employed it is necessary to add a separate 
step where single^strand isolation is performed. 

Isolation of a set of single-stranded NLFs can be accomplished using a set 
of capture probes, "Capmre probes" are oligonucleotides or polynucleotides 
comprising a single-stranded region complementary to at least one nucleotide 
sequence of the sickle-stranded NLFs to be isolated and a first binding moiety. 
The first binding moiety is capable of covalent or noncovalent binding to a second 
bmding moiety attached to a solid support. The capture probes can comprise a set 
of capture probes, each of which contains single-stranded regions complementary 
to a correspondmg member of a set of NLFs. A capture probe can also comprise 
a fiilMength single^stranded target nucleic acid that is complementary to the 



^^''^^ PCT/USW/03499 

49. 

nucleotide sequences of the members of a set of NLFs. The capture probes can be 
bound to a solid support using the methods described above for binding a taiget or 
set of NLFs to a solid support. 

If restriction cndonucJeases are used to produce nonrandom length 
fragments from DNA, the preferred method for isolating single-strand fragments 
from these products is to use a select set of capture probes. In one embodiment 
the capture probe consists of either full leqgtb positive or lull length negative 
strand where the strand has been modified to contain a solid-phase binding moiety. 
The process using full length negative strand modified to contain a biotin at the 5' 
end is illustrated in FIG. U. The capture probe is made and the target nucleic 
acid is fragmented in two separate reactions. Following inactivation of the 
restriction enzymes the probe and double-stranded fragments are mixed, denatured 
and annealed producing a hybrid product of positive strand fragments annealed to 
lull length negative strand capnirc probe. The capture probe can be bound to the 
solid phase via a biotin-streptavidin interaction prior to or following of the 
probe/fragment hybrid. Following the necessary wash steps die fragments are 
released and analyzed by mass specUwmetry. Optionally, the fragments can be 
probed for a mutotion-specific base-base mismatch and fragmented using one of 
the mismatch specific reagents described earlier. Ilhistrations of the different 
spectra produced without and wiA die optional second step are shown in FIG. 13. 
Note diat after mutation-specific, mismatch-specific cleavage fragments that are 
disal from the solid phase bindiqg site will be released into solution and washed 
away, therefore, not analyzed. Lose of these fragments can enhance the ability for 
mass spectromctiy to quickly and easily identify the site of mutation. 

An alternative an>roach to vsing restriction endonucleases is the use of 
fragmenting probes. These have been described in detail above, and allow the use 
of a target nucleic acid consisting of either DNA or RNA. The fmal products, 
using fragmentiqg probes and siqgle-strand-qiecific nucleases, are double-stranded 
and thus without any additional steps do not themselves produce die set of siugle- 
strandcd, nomandoro length fragments necessary for analysis. However, there are 
several approaches that can be used to yield single-stranded nomandom length 
fragments. 



wo 97/33000 PCTmsrfm499 

50. 

The first approach for producing single-stranded nonrandom length 
fragments is useful when the target is RNA and the probes are DNA or visa versa. 
In this case, the double-stranded products are RNA/DNA hybrids and can be 
selectively treated with either a DNA or RNA specific nuclease to yield the 
opposite NLF intact. Acid or base treatments are also an option. These single- 
stranded products can then be isobted using a number of conventional methods 
described above. 

A second approach to producing single-stranded products for mass 
spectrometry is to attach the size and sequence specific capture probes to a solid 
support before or after hybridization to the target nucleic acid and the single- 
strand-specific cleavage. Since the probes are bound to the solid phase it becomes 
possible to capture, wash, and then selectively release the nonrandom length target 
fragments as single-stranded molecules. Following any wash steps, the nonrandom 
length target fragments arc removed from t_he solid support by denardraiion of the 
double-stranded complex. Once released, the single-stranded fragments can be 
directly analyzed by the mass spectrometer. 

One of skill in the art will know how to use capture probes to capmre 
single-strands of a set of NLFs to a solid support in all the embodiments of this 
invention. For example biotinylated capture probes can be used to capture single- 
stranded fragments following cleavage of the target nucleic acid with restriction 
endonucleases (optionally after neutralizing the restriction endonucleases). The 
use of capmre probes provides a festively high level of flexibility to select which 
set of NLFs to analyze at any given time. Large capture probes, capable of 
hybridizing to all or several different fragments^ can.be used to capcuie the 
fragments correlating to one strand of a target nucleic acid, e.g. a cajMre probe 
that is Ml length negative strand. A short capture probe or combmations of 
shorter capture probes can be used to selectively choose particular fragmenu from 
either strand to analyze in a given mass spectrometric sample. For example, if 
several fragments share similar sizes it might be preferable to analyze them 
separately. 

As another embodiment, a full length target nucleic acid can be captured 
before restriction digestion using a capture probe that is nuclease resistant. In this 



wo 97/33000 , . 

PCT/US97/D3499 

51. 

case it is necessaiy to modify the capture probe, typically by changing the 
backbone compositicm from phosphate to a phosphorothioate, methyl phosphonate 
or borano-phosphate. [Uhlmann and Peyman. "Antisense OligonucleoUdes: A 
New Therapeutic Principle," Chemical Reviews 90(4):543.584 (1990) 
(incorporated by reference herein)) These forms of modification limit cutting on 
the probe strand, resulting only in the nicking of the target molecule to create 
sequence-specific, nonrandom length fragments without creating any double 
stranded breaks. By leaving the modified probe strand intact, it is possible to 
quickly capture the nonrandom lei^th fragments to the solid phase and purify for 
mass spectrometric analysis. 

All of these isolation or purification mediods can be utilized in cases where 
a mutaUon-specific cleavage event is utilized. In order to present a base mismatch 
mutation for cleavage, a heterozygous, double-stranded molecule must be piesem. 
Typically this means that the fragmenting probe is composed of the wild type 
sequence and is hybridized to the target nucleic acid fragments containiqg the 
potentially mutated target nucleic acid. ' 
Volahle Salts 

The methods of this invention include the use of volatile salts, which is an 
innovative alternative to NaCl. MgCI,, or other commonly used salts. Volatile 
salts aie any salts that completely evaporate, leaving little or no salt residue in die 
sample lo be analyzed in the mass spectrometer, for example, the isolated set of 
NLPs. Volatile salts useftd in the methods described herein include ammonium 
bicaifoonate, dimethyl ammonium bicarbonate and trimediyl ammoniura 
bicarbonate. These volatile salts are useflil in many different aspects of the 
methods described herein, including use in hybridizing of nucleic acids, washing 
nucleic acids to remove impurities, and digestion of mideic acids with 
endonucleases or other enzymes. Rather than perfbnniitg washes at reduced levels 
of nonvolatile salts, which might cause the nonraodom leqgth taiget fragments to 
denamre from a solid suK>ort bound oligonucleoUde probe, it is a prefcrmi 
embodiment.to wash support-bound nonrandom lengdi fiagmrats m the presence of 
relatively high levels of NHJICO,, e.g. 100 mM, and then to evaporate the 
volaUle salt prior to analysis by mass spectrometiy. Volatile salts aie useful for 



W0 97/3WD PCT/US97/0S4W 

52. 

buffer exchange in all cases where nucleic acids are to be analyzed by mass 
spectrometry. 

Solid phase purificacton schemes mvolving DNA hybridization commonly 
described in the literature do not focus on the removal of salts sim:e gel 
electrophoresis techniques are much more tolerant of salts than mass spectrometry, 
(S. Wang, M. Krinks & M, Moos "DNA Sequencing from Single Phage Plaques 
using Solid-Phase Magnetic Capture** Biotechniques H, 130 (1995); R, 
Sandaltzopoulos & P. Becker "SoUd-Phase DNase I Footprinting" Boehringer 
Mannheim Biochemica 4, 25 (1995); both incorporated by reference herein] 
These methods are primarily focus on the removal of strands complementary to 
template prior to enzymatic reaction and/or enzymes and unincorporated labeled 
nucleotides or prnners following reaction. In such schemes residual salt levels can 
be as high as lOOmM NaCI and 25 mM MgCI,. Mass spectrometry is intolerant 
of salt concentrations of diis level. [T. Shaler et al. **Effect of Impurities on the 
Matrix-Assisted Laser Desorption Mass Specua of Single-Stranded 
Oligodeoxynuclcotidcs" Anal. Chcm. fig, 576 (1996)] The methods described 
herein using volatile salts provide an iimovative approach to isolating and haiuiling 
target nucleic acids and/or nonrandom length fragments for mass spectrometric 
analysis. 

The volatile salts can be lanoved from the sample prior to mass 
spectrometric analysis by evaporation. Evaporation of die vokitile salts can be 
enhanced using a variety of methods* includiqg use of vacuum, heating* laminar 
flow of a diy gas over the sample, or» in the case of ammonium bicarbonate (or 
dimethyl- or trimethylammonium bicarbonate), reduction of the pH by addition of 
an acid, including 3-HPA* can speed up the decomposition of the salt into 
ammonia (or dimethyl- or trimethylanunonia) and carbon dioxide. Volatile salts 
can be used in a variety of tnetiiods, beyond those described here, for preparing 
samples of any number of organic molecules, inchiding proteins, polypeptides, and 
polynucleotides, for mass spectrometric analysis. 

Each of Che nonrandom fragmentation techniques described herein can be 
used in combination with any of the isolation methods also described herein. 
Moreover the nonrandom fragmentation techniques can be used in combination 



"^O^l^mO PCT/USW/03499 

53. 

with each other, as one of ordinary skill in the an using the techniques described 
herein how to combine the different aspects of the invention. For example, the 
mutation-specific cleavage technique can be combined with a set of restriction 
endonucleasc-cleaved NLFs. All of diese methods and combinations thereof can 
optionally include use of mass-modified nucleotides, internal calibrants and volatile 
salts. 

The kits described above for nonrandomly fragmenting target nucleic acids 
and delecting mutations in one or more target nucleic acids can also contain a 
combination of different means of nonrandomly fragmenting the target nucleic 
acids as well as different means of isolating the nonrandom length fragments that 
are lo be analyzed by mass spectrometiy. 

The following examples arc provided to illustrate embodiments of the 
invention, but do not limit the scope of the invention. 

Example !• PCR AmpUfication of Source Nucleic Acids* 

PCR methods have been extensively developed during the last decade. An 
example protocol is as follows. A sample containing 10-10,000 copies of a source 
DNA molecule is mixed with two antiparallel DNA primers that surround a targeted 
sequence, e.g. the coding region for a gene involved in carcinogenesis. The PCR 
mix is composed of: 8 pil 2.5 mM deoxynuclcoside triphosphates, 10 ftl lOX PCR 
buffer, 10 Ml 25 mM MgCL 3 ^1 lO^M forward primer, 3 fil lOitM reverse primer. 
0.3 Ml thermostable Taq DNA polymerase, 64.7 ^1 H^O, and 1 ^1 source DNA. The 
sample tube is sealed and placed into a thermal cycUng device. A typical cycling 
protocol is as follows: 



Step 1 


95'C 


2 min. 


Step 2 


95 'C 


15 sec. 


Step 3 


55'C 


15 sec. 


Step 4 


72 X 


1 min. 


Step 5 


repeat 


Steps 2-4 35 times 


Step 6 


72-C 


5 min. 


Step? 


stop 





^OtnmW FCr/US97/03499 

54. 

Example 2. ProductioD of Single-Stranded Nudeic Adds by Asyimnetric PCR, 
The basic PGR procedure can be modified in order to pnxJucc predominantly 
one of the two strands. These asymmetric procedures involve modifying the ratios 
of the two primers, a typical rado is 10:1. 

Example 3. Production of Single^tranded DNA via Biotinylated PCR Products. 

For the preparation of capture probes one of the two primers can be 
synthesized with a biotin moiety mtemally or at the 5' end of the oligonucleotide. 
FoUowing a standard PCR, die double-stranded product can be bound to a solid-phase 
surface coated with sutptavidin. For example. 10 pmol of double-stranded PCR 
product is mixed with 5 fi\ MPG [ID mg/ml] paramagnetic streptavidin^oated beads 
in a binding/washing buffer of 2,0 M NaCl, 10 mM TrisCl, i mM EDTA, pH 8.0. 
The solution is mcubated for 15 min. at room temperature witfi mixing. Following 
incubation the tube is placed next to a high field, rare earth magnet and uic 
paramagnetic beads mth die bouiwl biotinylated PCR product are precipitated to die 
wall of die tobe. The supernatant is removed, and the particles, outside die influence 
of the magnetic field, are resuspended into binding/washing buffer. The beads and 
wash solution are muted and dien subjected once again to die magnetic field to 
precipitate die magnetic particles. The supernatant is once again removed and cidwr 
die wash step is repeated or the alkaline denaturation step commences. In oider to 
release Ac uidiiotinylated strand from die double-stranded product the beads are 
mixed widi an alkalme denaturation sohition. 0. 1 M NaOH. The beads are incubated 
at room temperaoire for 10 min. which denatures tbe PCR product and releases die 
unbiodnylated product into sohidon. Hie biotinylated strand, bound to die magnetic 
beads is precipitated from die solution under die magnetic field and uhbiotinytated 
strand, now single-suanded, is nramfcrred to a new tube widi die supernatant. In an 
optional secondary step, die now siogle-stranded biotinylated strand can be freed from 
die magnetic beads by boiling die beads in water for 10 min and transferred widi die 
new supernatant after magnetic precipitation of the magnetic beads. 



^O^^ P(T/US97/W499 

55. 

Example 4. Mass Modification of Target Nucleic Adds. 

Mass modification of tlie target nucleic acid is perfonned during the 
ampJification step. One or more standard deoxynudeosidc triphosphates arc replaced 
with modified deoxynudeosidc triphosphates. As an example thymidine is replaced 
with a 5-alkynyl-substjtuted-2'-deoxyuiidine triphosphate. Because the modified 
nudeotides may not be efficient substrates for DNA polymerase it may be necessaty 
to increase the concentration of the corresponding triphosphate by a factor of 2 to 100 
over nonnat levels. 

Example 5. Nonrandom Ftagmentatioo of Double-Stranded Target Nucleic 
AdA Using Restriction Eadonudeases 

Specifically-sized, double-strand DNA products produced, for example, by 
PGR arc subjected to sequence-q)ecific fragmentation using restriction endonucleases. 
As an example, 10 pmoles of a 500 base jak PGR product is treated with one unit 
each of the frequcmly cutting enzymes Mnl I and HinP I m die buffer recommended 
by the enzyme supplier. The reaction is incubated at 37' C for 1 hour. foUowed by 
an enzyme-denaoiring incubation at 65 * C for 15 min. 

Example 6> Nonrandom Fragmenution of SingIe<Stranded Target Nucleic 
Adds Vang SmaU OUgonudeotide Restriction Site Probes in 
Combination idtb Restrictioo Endonucleases. 

Single-stranded DNA target, produced, for example, by asymmeirfc PGR or 
by dK solid phase method; described in Exkiple 3, is mixed with small 
oligonucleotide restriction probes complementary to selected restriction site locations. 
As an example, a set of 10 base long ptohts targeting the Hae in recognition 
sequence, are synthesized widi the sequence 

(SEQ ID NO: 1) 5' NNNGGCCNNN 3\ where die N's are diosen to allow the 
restriction site probes to fiilly complement die single-stranded target DNA at the sites 
where the Hae m recognition site (e.g. the probe (SEQ ID NO; 2) 5* 
GACGGCCAAA 3" to complement the target sequence (SEQ ID NO: 3) 5* 
...TTTGGCCGTC... 3'), The mixtuie of target and probes, dissolved in the 
restriction buffer to be used in die cleavage step, is denatured at 95 'C and dien 



wo 97/33000 PCT/US97/03499 

56. 

incubated ai 32"C (the average T„ melting temperature for the probes) for 15 min. 
allowing the probes to anneal to target and producing a mixture of single-stranded and 
double-stranded regions within the target nucleic acid . The hybridized product is then 
cleaved at the double-stranded sites using one or more specific restriction 
cndonuclcascs (e.g. Hac HI), under conditions similar to those described in Example 
3. 

Example?. Nonrandom Fragmentation of Single-Stranded Target Nucleic 
Adds Using Fragmentation Probes in Combination with Slngle- 
Strand-Spedfic Endonodeases. 

Single-stranded DNA target* produced, for example, by asymmetric PCR or 
by the solid phase methods described in Example 3, are mixed with fragmcming 
probes complementary to the target DNA. As an example, a mixture of probes with 
sizes of 24, 26, 28, 30, 32, and 34 each with sequences complementary to differwttt, 
nonoverlapping regions of the single-stranded target DNA. The mixture of target and 
probes, dissolved in SI nuclease digest buffer comprised of 50 mM NaAcetate pH 
4.5, 280 mM NaCI, 50 mM MgClj, and 4.5 mM ZnSO^^are denatured at 95 *C and 
then mcubated at 55'C (the average T„ for the probes) for 15 min. allowing the 
probes to anneal to target and inoducing a onixture of single-snranded and double- 
stranded regions within the target nucleic acid. The hybridized product is then 
digested in the single-stranded regions using 1 U Si nuclease per ng target DNA, 
incubated at room temperature for 30 ihin. 

Examples. Nonrandom Fh«nieiilatioD of SlncJe-Stranded Target Nucldc 
Adds Using Mismatdi-Spedfic Cleavage. 

Examole 8.1. Chemical Cleavage at Mismatched CvtoMiift 

A heterozygous, mutation-containing DNA target is produced, either by PCR 
of a heterozygous source nucleic acid or by hybridization of wild-type probes to a 
mutation- containing single-stranded urget DNA. For solid phase capture and 
purification protocols the DNA probes are synthesized either chemically or 
enzymaticaUy in such a way as to contain biotin moieties. By cither route, when a 
mutation is present a mismatch forms between the target and wild type. A cleavage 



PCT/US5»7/«34»9 

57. 

solution of hydiDxylamine is prepared by dissolving 1.39 g of hydroxylamine 
hydrochloride in 1.6 mL of warm H.O followed by the dropwise addition of 1 .75 mL 
of diethylamine to yield a solution of pH 6. A 6 mL sample of double-stranded DNA 
containing a mismatch site is mixed with a 20 mL of hydroxylamine solution and the 
resulting solution is incubated at 37-0 for 30 minutes. The leaction is stopped by the 
addition of 374 mL of H,0 and the solution is removed cither by solid phase captuie 
of the reaction products using magnetic beads with washes perfbimed in a similar 
manner to that described in Example 3 or by muIUstep centrifiigation in a Microcon- 
30 ultrafiltration unit (Amicon). The reaction products are redissolved in 45 mL of 
H,0 and 5 mL of pipaidiat is added. The solution is incubated at 90C for 30 
minutes and then placed on ice to cool. A 300 mL portion of H,0 is added and 
samples are either evaporated to diyness or purified by one of the two methods 
described in Examples 9 and 10. 

A typical mass spectium obtained from the hydroxylamine ftagmentaiion at 
a point mutation is shown in HG. 14. The source DNA in this case is a section of 
the coding sequence for the p53 gene. A 134 base long PGR piwlnct is ptoduced as 
in Example 1, amplifying p53 from codon 188 to 233 containing a heterozygous point 
mutation in codoo 213. CGA- > TGA. The forward primer containii^ a 5*-biotin and 
a chemically labile linker within the primer, the reverse primer being a standard 
oligonucleotide. The mismatch conteining PGR product is treated with hydroxylamine 
as described above, cleaving the mismatch at C in codon 213. The product is 
purified as described in Example 10, and analyzed as described in Example. 11. A 
strong peak appears at the mass correlating to a product 75 bases in size identifying 
that a C is present in a mismatch in the first position of codon 213. An analysis of 
mutation-free wild type, shown in FIG. 15. conuins no mismatch and therefore no 
cleavage occurs. 

Example 8.2. Chemical q^flvage at Misma tched Thym m^^ 
DNA is obtained in a similar manner to Example 8.1. The modification 
reagem is a 20 mM solution of KMnO. in deionized H,0. To 6 mL of double^ 
stranded DNA containing a mismatch site is added 14 mL of the modification 
reagem. The solution is mixed gently at room temperature over the course of two 



wo 97/33000 PCT/US^/03499 

58. 

minutes during which time the solution turns slightly brown. A 20 mL portion of a 
solution consisting of 1.25 M sodium acetate pH 8.5 and containing 1 M 2- 
mercaptoethanol is added to stop the reaction, which results in the solution becoming 
immediately colorless. A 360 nnL portion of H,0 is added and the solution is either 
spun through a Microcon-30 ultrafiltratioo unit 2X, collected, and then evaporated to 
dryness or taken through a solid phase capture and wash protocol. The DNA is 
rcdissolved in 45 mL of H,0 and 5mL of piperidine is added. The resulting solution 
is heated to 90C for 30 minutes and then placed on ice to cool. After it cools, the 
solution is diluted by the addition of 300 mL of tiO and then evaporated to diyness. 
As an ahemative the cleavage products can be purified by one of the two methods 
described in Examples 9 and 10. 

A typical mass spectrum obtained from the KMn04 fragmentation at a point 
mutation is shown in FIG. 16. The source DNA in this case is a secdon of the 
coding sequence for the p53 gene. A 134 base long PCR product is produced as in 
Example 1, amplifying p53 from codon 188 to 235 containing a heterozygous point 
mutation in codon 213. CGA- > TGA. The forward primer containing a 5*-biotin and 
a chemically labile linker within the primer, the reverse primer being a standard 
oligonucleotide. The mismatch containing PCR product is treated with KMn04 as 
described above, cleaving the mismatch at C in codon 213. The product is purified 
as described in Example 10, and analyzed as described in Example 11. A strong 
peak appears at the mass correlating to a product 75 bases in size identifying that a 
T is present in a mismatch in the first position of codon 213. Based on the data from 
the analysis in FIG. 14 and FIG. 16 it is possible to confirm that a C- > T mutation 
has occurred in this pS3 sample. 

Example 9. Purification of Nonraadom Length Fragments Using Capture Probes 
Nonrandora fragments are purified by annealing to a capture probes. The 
ct^ture probe or probes consists of a sequence or sequences complementaiy to the 
selected target nonrandom length fragments. One method uses the a full length 
capture probe prepared as described in Example 3, another uses a number of 
chemically synthesized capture probes prepared with biotin covalendy attached. For 
either method the procedure is identical. A 10 /iL sample containing a single fiill- 



WO'^'M** PCT/US97/n3499 

59. 

length biotinylated capture probe or a mixture of smaller, synthetic, biotinylated 
capture probes is mixed with 10 of nonrandom fragments in an annealing buffer 
consisting of 300mM NaCI, lOmM Tris. and ImM EDTA pH 7.5. The mixture is 
heated in a boiling-HjO badi for 10 min. and then quickly placed in an icc-HjO bath. 
The mixture is then transferred to a pre-heated thennal block at 42 'C (the 
temperature is adjusted depending on the T;. of the capture piobc or probes) and 
incubated for 1 hour. The solution is then allowed to cooJ and then mixed with 
streptavidin-coated magnetic beads. Binding to the beads takes place accoiding to the 
procedure described in Example 3. After the binding step, in place of die alkaline 
denaturation step, the bound, hybridized nonrandom fragments are washed with a 
volatile buffer such as 1 M NH^HCOj. After 6 cycles of resuspension in 1 M 
NHJiCO,. magnetic precipitation, and removal of the supernatant, the beads are 
resuspcnded in 10 iiL of deionized H,0 and heated to 65"C for 5 min. in order to 
release the nonrandom fragments from the bound biotinylated strand. The beads are 
quickly ptecipiuted from die warm solution and the suporotam containing the 
nonrandom fragments is transferred to anodier tube. The solution of nonrandom 
fragmeids is dried to remove excess volatile buffer and then analyzed by mass 
spectrometiy as described io Example 11. 

An example of capture and analysis of noniandom length fragments is shown 
in FIG. 17. The source DNA in this case is a section of the coding sequence for the 
p53 gene. A 184 base long PGR product is produced as in Example l. amplifying 
p53 from codon 232 to 292 cimtainiqg a heterozygous point mutation in codon 248, 
COG- > GAG. The double-stzaadcd PCR product is digested using the icstriciion 
enzyme Moll under conditions described in Example 5. A fiiU length capture probe 
of tbe negative stnmd is produced as in Example 3, and the nonrandom length 
fiagmems derived from the positive strand are captiued and purified as described 
above. The purified siqgle-straoded fiagments are analyzed as described in Example 
II. Shown in FIG. 16 are the 5 single-stianded positive fragments produced from 
an Mnl I digest of tbe wild type 184 base long PC» product, By performing single- 
stranded isolation the five simUarly sized negative strand fragments are eliminated 
from the spectra and all of the fiagments arc Ailly resolved. 



W«>"^ PCTAJSW/03499 

60. 

Shown in FIG. 18 is a magnification of ihe specira examining the 26 base long 
fragment that, in the heterozygous muution case^ contains the G->A mismatch. 
Shown are two clearly resolved peaks with a mass difference of 16 Da, exactly the 
difference between G and A and thus confirming the presence of a mutation. The 
third smaller peak correlates to a salt adduct of the high mass 26 base product and 
emphasizes die need for a process that stringently removes salt prior to analysis. 

Example 10. Alternative Purincation Method for Mismatch-Spedflc Nonrandom 
Length Fragments, 

The purification of nonrandom fragments that were produced by a mutation- 
specific cleavage, e.g. chemical cleavage at mismatch sites, can be achieved in an 
alternative way. In this case the fragmentation is performed on a PGR product that 
has one solid- phase capturable strand, e^g. containing biotin, and that is also able to 
be cleaved from the solid support, e.g. a bridging phcsphorothioaie linkage contained 
m the primer region [Mag ct al.. Nucleic Acids Res. 19(7): 1437-1441 (1991)J. As 
an example of this method, a PGR reaction is performed as described in Example 1» 
but with one of the primers containing a 5 '-end biotin modification and also a 
bridging pbosphorothioate linkage located 3-5 bases from die 3'-end, and die other 
primer a normal one. After amplification the PCR product is subjected to a mutation- 
specific fragmentation method dirccdy since, for heterozygous mutadons, mismatch- 
containi]^ heteroduplexes are formed in situ during the PCR. In order to check for 
the possibility of a homozygous mutation, the sample is mixed with an equal amoum 
of wild type control, annealed and dien subjected to die fragmentation reaction. The 
material recovered from the fragmemadon reactions is purified and made single- 
soranded by the mediod described in Example 3. In this case, after the denaturing 
step, the products are released from die magnetic beads after several H^O washes by 
treatment with 5 of 0.02 mM AgNO, and incubaring at 4S'C for IS min. The 
Ag+ ions are sequestered by dw addidon of 1 iiL of 100 mM DTT. The samples 
are dried to remove excess DTT and dien analyzed by mass spectrometry by die 
mediod described in Example 11. 



wo 97/33000 

PCT/US97/03499 

61. 

Example 11. Mass Spectrometry Analysis. 

The nucleic acid sample to be analyzed is typically mixed with an equal 
volume of matrix solution consisting of 0.5 M 3-hydroxypicoIinic acid (3-HPA) and 
50 mM diammonium hydrogen citrate. Typically a 1 portion of the sample is 
applied to the mass spectrometer sample suge and aUowcd to dry under a gentle 
stream of nitrogen gas at room tempenimte. When die sample has completely dried 
to form crystals (typically 5 mm.) the sample is inserted into the mass spectrometer 
for analysis. The usual analysis conditions employ the use of a Nd:YAG laser 
operating at 266 nm with an average pulse energy of 50mJ/cm'. An average of 100 
laser shots is typically used to obtain a spectrum. 

All publications and patent applications nxnUoned in this specification are 
herein incorporated by reference to the same extent as if each individual publicaaon 
or patent appUcation was specifically and individually indicated to be incorpoiated by 

reference. 

The invention now being ftilly described, It will be apparent to one of oidmary 
skiU in the ait that many changes and modifications can be made thereto widjout 
departing from the spirit or scope of the invention and the appended claims. 



wo 97/33000 



62. 



PCT/U597/03499 



SEQUENCE LISTING 
(1) GENERAL INFORMATION; 

(i) APPLICANT: MONPORTE, JOSEPH A, 

SHALER, THOMAS A. 

TAN, YUPIMG 

BECKER, CHRISTOPHER H. 

(ii) TITLE OF INVENTION: METHODS OF SCREENING 
NUCLEIC ACIDS USING MASS 
SPECTROMETRY 

(iii) NUMBER OF SEQUENCES: 3 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: COOLEY GODWARD LLP 

(B) STREET: FIVE PALO ALOTO SQUARE 

3000 EL CAMINO REAL 
<C) CITY: PALO ALTO 

( D ) STATE : CALI FORNIA 

(E) COUNTRY: USA 
(P) ZIP: 94306 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC conipatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 
<D} SOFTWARE: Patentin Release #1«0, Version 



#1.25 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: MARCH 4, 1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATON DATA 

(A) APPLICATION NUMBER: 60/012,752 

(B) FILING DATE: MARCH 4, 1996 ' 

(vili) ATTORNEY/AGENT INFORMATION: 

(A) NAME: JACKIE N. NARAMURA 

(B) REGISTRATION NUMBER: 35,966 

(C) REFERENCE/DOCKET NUMBER: GNTR-OOl/.OlWO 

(ix) TELECOMMUNICATION INFORMATION; 

(A) TELEPHONE: 415-843-5214 

(B) TELEFAX: 414-857-0663 



WO97/33000 

63. 

(2) INFORMATION FOR SBQ ID NO: 1: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 10 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA, RNA 

(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 

(v) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(vi) SEQUENCE DESCRIPTION; SEQ ID NO: 
NNNGGCCNNN 

(3) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA, RNA 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(V) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(vi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GACGGCCAAA 



(4) INFORMATION FOR SEQ ID NO: 3: 

(i) , SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA, RNA 



10 



"^097^0^ PCT/US97«^99 

64. 

(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 

(v) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(Vi) SEQUENCE DESCRIPTION: SEQ ID NO; 2: 
TTTGGCCGTC 



wo 97/33000 

PCTAJS97/03499 

65. 

WE CLAIM 

1. A method of detecting mutations in a target nucleic acid comprising: 

obtaining from said target nucleic acid a set of nonrandom length fragments 
(NLFs) in single-stranded form, wherein said set comprises NLFs 
derived from one of either the positive or the negative strand of said 
target nucleic acid or said set is a subset of single-stranded NLFs 
derived from both the posiUve and the negative strand of said target 
nucleic acid, 

determining masses of the members of said set using mass spectrometry. 

2. The method of claim 1 wherein at least one member of said set of single- 
stranded NLFs optionally has one or more nucleotides replaced with mass-modified 
nucleotides. 

3. Hie mediod of claim 2 wherein said determining step optionally further 
comprises 

utilizing iniemal self-calibrants to provide improved mass accuracy. 

4. The method of claim 3 wherein said target nucleic acid is single-stranded and ' 
said obtaining step further comprises: 

hybridizing said single-stranded target nucleic acid to one or more sets of 
fragmenting probes to form hybrid target nucleic acid/fragmenting 
probe complexes comprising at least one double-stranded region and 
at least one single-stranded region, 

nontandomly fragmenting said target nucleic acid by cleaving said hybrid 
target nucleic acid/fragmenting probe complexes at every single- 
stranded region with at least one single-strand-specific cleaving reagent 
to form a set of NLFs. 

5. The method of claim 4 wherein said set of fragmenting probes leaves single- 
stranded gaps between doublewstranded regions formed by hybridization of said set 
of fragmenting probes to said target nucleic acid. 



WO97/3300D PCT/i;S97/03499 

66. 

6. The method of claim 5 wherein said hybridizing step fiiiiher comprises: 
providing two sets of single-stranded target nucleic acid and 
separately hybridizing a first set of fragmenting probes to a first set of single- 
stranded target nucleic acid and a second set of fragmenting probes to 
a second set of single-stranded target nucleic acid, wherein said 
members of said second set of fragmenting probes comprise at least 
one single-stranded nucleotide sequence complementary to regions of 
said target nucleic acid that are not complementary to any nucleotide 
sequences in any members of said first set of fragmenting probes. 

7. The method of claim 6 wherein said members of said first sd of fragmenting 
probes comprise nucleotide sequences that overlap with nucleotide sequences of said 
members of said second set of fragmenting probes. 

8. The method of claim 4 wherein said single-strasd-specific cleaving reagent is 
a single-stia^d-spectfic endonuclease. 

9. The method of claim 4 wherein said single-strand-specific cleaving reagents 
are single-strand specific chemical cleaving reagents. 

10. The method of claim 9 wherein said single-strand specific chemical cleaving 
reagents are selected from the group consisting of hydroxylamine, hydrogen peroxide, 
osmium tetroxide, and potassium permai^anate. 

1 1 . The method of claim 4 further comprising after said nomandomly fragmenting 

step: 

hybridizing one or more of said NLFs to one or more capmre probes, wherein 
said capture probes comprise a single-stranded region complementary 
to at least one of said NUFs and a first binding moiety, 

binding said first binding moiety to a second binding moiety attached to a 
solid support, wherein said binding occurs either before or after said 



wo 97/33000 »^„.„- 

PCT/US97/03499 

. 67. 

hybridizii« of said NLFs to one or more wpturc probes, isolating a set 
of single-stranded NLFs. 

12. The method of claim 4 wherein said fragmenting probes comprise a single- 
stranded nucleotide sequence and a nm binding moiety, fiuther comprising: 

after said nonrandomly fragmenUng step, binding said fast binding moiety to 
a second binding moiety attached to a solid support, and 

isolating said set of single-stranded NLFs. 

13. The method of claim 3 wherein said obtaining step further comprises; 

nonrandomly fragmenting said taiget nucleic acid with one or more restriction 
endonucleases to form a set of NLFs. hybridizing one or more of said 
set of NLFs or a subset thereof to one or more oUgonucleotide probes, 
wherein each of said oligonucleotide probes comprises a nucleic acid 
comprising a single-straoded region and a fust binding moiety, binding 
said first binding moiety to a second binding moiety attached to a solid 
support either before or after said hybridizing step, and isolating said 
set or subset of single-strarafcd NLFs. 

14. TTie method of claim 13 wherein aU of said oligomjcleotide probes consist of 
one of either fiill-length positive or full-length negative single strands of said target 
nuclek acid and a first binding moiety. 

15. The method of claim 13 wherein said binding between said fust binding 
moiety and said second binding moiety is a covalent attachment. 

16. nc method of claim 13 wherein one binding moiety is a member selected 
from the group consisting of an antibody, a hormone, an inhibitor, a co-factor 
portion, a binding ligand, and a polynucleotide sequence, and the other binding 
moiety is a corresponding member selected from the group consisting of an amigen 
capable of recognizmg said antibody, a receptor capable of recognizing said hormone, 
an enzyme capable of recognizing said inhibitor, a cefaclor enzyme binding site 



68. 

capable of recognizing said co-factor portion, a substrate capable of recognizing said 
binding ligand, and a coniplementary polynucleotide sequence. 

17. The method of claim 13 wherein said isolating further comprises: 
washing said set of NLFs bound to said solid support with a solution 

comprisii« volatile salts selected from the group consisting of 
ammonium btcarbdnatc dimethyl ammonium bicarbonate and trimethyl 
ammonium bicarbonate. 

18. The method of claim 3 wherein said target nucleic acid is single-stranded and 
wherein said obtoining step further comprises: 

hybridizing said single-stranded target nucleic acid to one or more rcsuiction 
site probes to form hybridized target nucleic acids having double- 
stranded regions where said rEstrietion site probes have hybridized to 
said single-stranded target nucleic acid and at least one single-stiandcd 
region, nonrandomly fragmenting said hybridized target nucleic acids 
using one or more restriction endonucleases that cleave at restriction 
sites within said double-stranded regions. 

19. The method of claim 18 further comprising after said nonzandomly 
fragmenting step, 

hybridizii^ said NLFs to one or more capture probes, wherein said capture 
probes comprise a single-stranded region complementary xo at least one 
of said NLFs and a first binding moiety, btoding said first binding 
moiety to a second binding moiety attached to a solid support, wherein 
said binding occurs either before or after said hybridizing of said 
NLFs to one or more capture probes, isolating a set of single-stranded 
NLFs. 



20. The method of claim 19 wherein said cleaved restriction site probes comprise 
a single-stranded region complementary to half of a restriction cndonuclease site and 
a fnrsi binding moiety, and further comprising after said nonrandomly fragmenting 



69. 

step, binding said Fim binding moiety to a second binding moiety attached to a solid 
support, and isolating a set of singlc-straoded NLFs. 



21 . The method of claim 3 wherein said target nucleic acid is single-stranded and 
said obtaining step further comprises: 

providing conditions permitting folding of said single-stranded target nucleic 
8t:id to fonn a Aree-dimeiisional structure having iniramolecuiar 
secondary and tertiary intetactioiis, 
nonrandomly fragmenting said folded target nucleic acid with at least one 
structure-specific endonuclease to form a set of single-stranded NLFs. 

modifying cither said target nucleic acid or said set of single-stranded NLFs 
such that members of said set of single-stranded NLFs comprise a single-stranded 
nucleotide sequence and at least one first binding moiety, 

binding said first binding moiety to a second binding moiety attached to a solid 
support, aiKl 

isolating said set of singlc-stianded NLFs. 

22 The method of claim 3 wherein said target nucleic acid is single-stranded and 
said obtaining step further comprises: 

providing conditions petmiaing folding of said single-stranded target nucleic 
acid to form a three-dimensional structure having intramolecular 
secoadiry and tertiary interactions, 
nonrandomly fragmenting said folded target nucleic acid with at least one 
structure-specific endonuclease to form a set of single-stranded NLFs. 

hybridizing one or more of said set of NLFs to one or more capture probes, 
wherein said capmre probes comprise a single-stranded nucleotide 
sequence and a first binding moiety, 
binding said first binding moiety to a second binding moiety attached to a solid 

support either before or after said hybridizing step, and 
isolating a set of sii^e-sttanded NU^s. 



wo 97/33000 PCT/USy7y03499 

70. 

23. The method of daiin 21 wherein said isolated set of single-stranded NLFs 
comprise any NLFs having a 5' end of said target nucleic acid. 

24. The method of claim 22 wherein said isolated set of singJe-strandcd NLFs 
comprise any NLFs having a 5' end of said target nucleic acid. 

25. The method of claim 21 wherein said strucmrc-specific endonuclease is 
selected from the group consisting of: 

T4 endonuclease vn. RuvC, MutY, and the endonucleolytic activity fiom the 
5**3' exonuclease subunit of thenno-stable polymerases. 

26. The method of claim 3 wherein said tergct micleic acid is single-stranded aod 
wherein said obtaining step further conq>rises: 

hybridizing said si^gie-stranded target nucleic acid to one or more wild type 
probes» 

nonrandomly fragmentiqg said target nucleic acid with one or more mutation- 
specific cleaving reagents that specifically cleave at any regions of 
nucleotide mismatch that form between said target nucleic acid and any 
of said wild type probes. 

27. The method of claim 26 wherein said nonrandomly fragmenting step further 
comprises: 

digesting said first set of noniandom IcngA fragments with one or more 

restriction endonucleases or 
cleaving said first set of nonrandom length fragments with one or more single- 

$trand*speciiic cleaving reagents. 



10 



wo 97/33000 PCr/US97/03499 

71. 

28. The method of claim 26 wherein members of said set of single-stranded NLFs 
comprise a single-stranded region and at least one first binding moiety, further 
comprising after said nonrandomly fragmenting step, bindmg said first binding moiety 
to a second binding moiety attached to a solid support, and isolating a set of single- 
stranded NLFs. 

29. The method of claim 26 wherein said obtaining step further comprises: 
hybridizing members of said set of NLFs to one or more capnire probes, 

wherein said capnire probes comprise a siiigle-stranded nucleotide 
sequence and at 1^ one first binding moiety, binding said first 
binding moieor to a second bindiiig moiety attached to a solid support, 
and isolating a set of single-stranded NLFs. 

30. The method of claim 26 wherein said obtaining step flutber conq)rises: 
isolating a set of single-stranded NLFs comprising any NLFs having a 5' end 

of said target nucleic acid. 

31. A method of detecting mutations in a target cwcleic acid comprising: 
nonrandomly fragmenting said target nucleic acid with one or more restriction 

20 cndomicleases to form a set of double-stranded NLFs, wherein said 

nonrandcmly fragmenting further comprises using volatile salts in a 
restriction buffer, determining masses of the members of the set of 
double-stranded NLFs, wherein said detenniniiv does not involve 
sequencing of said target nucleic acid. 

25 

32, A method of detecting muutions in a double-stranded target nucleic acid 
comprising: 

nonrandomly fragmenting said target nucleic acid using one or more 
restriction endonucleases to fonn a first set of nonrandom length 
30 fragments (NLFs)» 

hybridizing members of said fust set of NLFs to a set of wild type 

probes, 



wo 97/33000 PCr/US97A)3499 

72. 

nonrandomly fragmenting one or more members of said set of NLFs 
^>*onc or more mutation-specific cleaving reagents that 

specifically cleave at any regions of nucleotide 
mismatch that form between members of said first set 
of NLFs and complementary members of said set of 
wild type probes, wherein said nonrandomly 
fragmenting step forms a second set of NLFs, and 

deierminiQg masses of members of said second set of NLFs using mass 

spectrometiy, wherein said determining does not require sequencing of 

said target nucleic acid. 

33. The method of claim 32 fiirdier oomprisipg 

obtaining said set of wild type probes by nonrandomly fragmenting a wild 
t>pe target nucleic acid using the same restriction endonucleases used 
to form said first set of NLFs. 

34. The method of claim 33 wherein said steps of nonrandomly fragmenting of 
said target nucleic acid and obtaining said set of wild type fragmenting probes arc 
performed simultaneously in a single solution. 

35. The method of claim 32 further comprising before said determining step, 
isolating said second set of NLFs wherein said members of said second set 

comprise double-stranded nucleotide sequences and a first binding 
moiety, and binding said first binding moiety to a second biiuling 
moiety attached to a solid support. 

36. The method of claim 32 further comprising before said determining step, 
isolating said second set of NLFs wherein said isolating comprises hybridizing 

members of said second set of NLFs to one or more capture probes, 
wherein said capture probes comprise a single-stranded nucleotide 
sequence and a first binding moiety, binding said first binding moiety 
to a second binding moiety attached to a solid support. 



73. 

37. A method of detecting mutations in a target nucleic acid comprising: 

noniandomly fragmenting said target nucleic acid, using a solution comprising 
one or more volatile salts to form a set of nonrandom length fragments 
(NLFs). 

detennining masses of members of said set of NLFs using mass spectrometry, 
wherein said deteimining does not involve sequencing of said target 
nucleic acid. 

38. A method of decieasing background noise comprising 
obtaining a sample to be analyzed by a mass spectrometer, 
washing said smpUt with a sofaition of volatile salts, and 
evaporating the sohition of volatile salts from the sanqile. 

39. A method of obtaining nonrandom length fragments from a urget micleic ackl 
comprising: 

hybridizing one or more sets of fragmenting probes Co said target nucleic acid 

to form a set of hybrids, 
cleaving single-stranded r^ions of membeis of said set of hybrids. 

40, A kit for detecting munitions in one or more target nucleic acids in a sample 
coinprising: 

(a) one or more sets of fragmenting probes, wherein said fragmenting 
probes are complementaiy to a sequence of one or more of said tai^et 
nucleic acids; 

(b) a single-strand specific cleaving reagent; and 

(c) a solid support capable of isolating said single-stranded target nucleic 
acids that have been nonrandomiy fragmented into single-stranded 
nonrandom length fragments. 

41 . The kit of claim 40, wherein said single-strand specific cleaving leagent is a 
single^stiand-specific chemical cleaving reagent selected from the group consisting of 
hydroxylamine. hydrogen peroxide, osmium letroxide, and potassium permanganate. 



"^O^rrr^MO FCT/US97ya5499 

74. 

42. The kit of ciaim 40, wherein said single-strand specific cleaving reagent is a 
nuclease selected from the group consisting of Mung bean nuclease. Nuclease SI . and 
RNase A. 

43. A kit for detecting mutations in one or more target nucleic acids in a sample 
comprising: 

(a) one or more sets of restriction site probes, wherein said probes 
comprise a single-stranded sequence capable of hybridizing to a 
sequence of said one or more target nucleic acids; 

(b) one or more restriction endomicleases that cleave at restriction sites 
within said restriction site probes; and 

(c) a solid support capable of isolatmg said single-stranded target nucleic 
acids that have been nonrandomly fragmented into single-stranded 
nosraadom length &agnieols. 

44, The kit of claim 43. wherein said restriction endonuclease is a Class ns 
restriction endonuclease. 



45 . The kit of claim 43 , wherein said restriction site probe comprises two regions, 
a first region that is single-stranded and complementary to a q)ec[fic sequence within 
said target nucleic acid, and a secoml region that is double-stranded and contains a 
restriction recognition site for a Class US restriction endonuclease. 



wo 97/33000 



1/22 



FCT/USyr/03499 



0) 

c 



7 


2* 




[72+MJ* 


71 + 

L / 


'Il72+2M}* 


1 • 1 ■ ^ 





18000 20000 



22000 24000 
m/z 
Fig. 1A 



26000 



c 

iB 
c 




24000 26000 28000 30000 32000 34000 36000 

m/z 

Fig. 1B 



SUBSTITUTE SHEET (RULE 28) 



wo 97/93000 



2/22 



PCTA;S97A)3499 



Top Primer 



Soured Nucleic Acid 



Bottom Primer 



PGR amplify 



161 



Target Nucleic Acid 



Number indicates fragment length 

^ or • indicates positive or negatrve strand 



Treat with restriction endonucleases 



34^ 



Id'h 32<i- 21-1- 2B^ 



27+ 



30- 19- 32- 29- 20- 



31- 



19- 19+ 
/20- 



ill 



Purify and mass analyze single-stranded 
nonrandom length nucleic acid fragments 



32'«-32- 

27+ 28+ 29- 30- 31- \ / 34+ 



|illllllll|illllllll|IIII IHII|llllllll l jmiU|||ji l |jt | |||j 
6000 ^ 7000 8000 9000 10000 11000 12000 

Mass 



Rg.2 



SUBSTITUTE SHEET (RULE 2$) 



wo 97/33000 



3/22 



PCT/US97/03499 



Heterozygous mix 



Wild type 



344> 194. ^ 32+ 2U 

30- 19- 32- 29- 



Mutant (A to T iransversion) 
34+ 19+ 
30- 19- 



+ 



-T- 
— A- 
32- 



32-»- 



21+ 



29- 



26* 



27+ 



20- 



31- 



28+ 



27+ 



20- 



31- 



19- 19+ 
/ 20- 



ill 



Purify and mass analyze single-stranded 
nonrandom length nucleic acid fragments 



TTT 



rr 



32+ (Mut) 
32+(Wt) 
i32-{Wt) 
I 32- (Mut) 



27+28+ 29-30- 31- 



34+ 



iinii[i i iiiiui|Ninim|i i ii i iiiijiiiim i r 



6000 



TTTT 
7000 



8000 



9000 
Mass 



10000 



11000 



12000 



Fig. 3 



SUBSTITUTE SHEET (RULE 2B) 



wo 97/33000 



4/22 



PCT/OS97/03499 



mutant 

...GCACTaGCC... wild type 

...GCACAAGCC... 




Nlllllll|lllllllll|lllilllll 

4700 4800 4900 5000 

Mass 



Rg.4A 



wild type 

...GCACAAGCC... 



mutant 
.GCAC(Br)dUAGCC... 




Hiiiiiiiiiiiiinii 



4700 



1 

4800 



4900 



llllllll 



5000 



Mass 



Fig. 4B 



SUBSTITUTE SHEET (RULE 29) 



wo 97/33000 



5/22 



PCt/USy7/03499 



Heterozygous mix (positive strand only) 



Wild type 



34+ 



19+ 32+ 21+ 28+ • 27+ 

— —A ■ ■ ' ■ 



Mutant (A to T transverslon) 



+ 



^ 32+ 21+ 28+ 



27+ 



Purify and mass analyza single-stranded 
nonrandom length nucleic add fragments 



19+ 

j 21+ 



Zr+ 28+ 



32+(Mut) 

32+<Wt) 



\ 32h 

v 



34+ 



|'iiNiiN[i iii ii in[iiiiniii|iiiiii i ii|ii}|ini|.jitin)in - 

6000 7000 8000 9000 10000 11000 12000 

Mass 



Fig. 5 



SUBSTITUTE SHEET (RULE 26) 



wo 9703000 



6/22 



PCTAJS97/03499 



Single-Stranded nucleic acid target 



16U 



mutation 



Hybridize restriction site probes to form 
doubie-stranded restrrctions sites 



16U 



Fragment target nudeic add using 
restriction endonudeases. 



34+ 



19+ _32+ 2U 28+ 



-T- 



27+ 



(1) Purffy nonrandom length nucleic acid 
fragments. 

(2) Analyze by mass spectrometry. 



Standard expected spectra 
19+ 

21+ 



27+ 28+ 



32+{Mut) 34+ 



[imi i i i i[niiiiui|ii ii imijmniiii|iiiiiiiii[iiiii{iii| 

7000 8000 9000 10000 11000 12000 

Mass 



Fig. 6 



SUBSnmE SHOT (RULE 20) 



wo 97/33000 

Top Primer 



7/22 

Source Nucleic Acid 



PCT/DS97/03499 



Bottom Primer 
Amplify, yield + strand product only 



161+ 



20* 



22- 



34- 



26- 



24- 



32- 



30- 



single-stranded target nucleic acid 
/ 

sets of 

oligonucleotide probes 
^ complimentary to the 
^ 161+ target nucleic 

28. 



Hybridize oligonucleotide probes 
fo target nueieie acid 



16U 



20- 22- 26- 

161+ 



24- 



Mixture of hybrids are formed 



34- 



32- 



30- 28- 



Digest with single-strand-specific endonuclease 
(or ss specific chemical traaiment) 



20f 



20- 



22<f 26» 24h 

22- 26- "24. 

32+ 30+ 28+ 



nonrandom length nucleic 
acid fragments 



34- 



32- 



30- 



26- 



Rg. 7A 



SUBSTITUTE SHEET (RULE 26) 



wo 97/33000 



8/22 



PCT/U5»7/03499 



Isolate nonrandom length nucleic acid fragments 
from oltgonucieotide probes and analyze by mass 
y J spectrometry 



20+ 22+ 24+ 26+ 28+ 30+ 32+ 34+ 



lllllllfTT 



4000 



5000 



6000 7000 
Mass 



8000 



9000 10000 11000 



Fig. 7B 



SUBSTITUTE SHEET (RULE 26) 



wo 97/33000 



9/22 



PCT/OS97/034W 



Single-Stranded target nucleic acid 



I- 



St cut sites 



-c-oo 



Add fragmenting probes complementary to most 
oi the target molecule leaving a few gaps with 
individual C's exposed as single stranded. 



1 1 



— c— cc occ— ccc-c— -c— oc-o 



1 1 



Fragment at single-stranded C sites. 



C— CC 



c-cc-~ccc- c c — c- c-c- 



Selectively Isolate nonrandom length fragments. 



C— CC— C-CC—- -CCC- C- 



C-C- 



Analyze by mass spectrometry. 



Rg.8 



SUBSTITUTE SHEET (RULE 28} 



wo 97/33000 



PCT/US97/03499 



10/22 



Top Pfim ef 



Source Nucleic Acid 



20- 



22- 



34- 



Bottom Primer 
Amptify, yield ^ strand product onfy 



161 + 



mutant stngte-stranded target nucleic acid 



+ 

32- 30- 



24- 



sets of fragmenting probes 
complementary to the 161+ 
- — target nudeic acid 



28* 



Hybridize oligonucleotide probes 
to target nucleic acid 





16U 


— T ' 












20* 


22- 
161 + 


24- 





hatBrozygous mutan^wild 
type T*T mismatch 



34- 



32- 



30- 



28^ 



Digest with single-strand-speclfic endonudease 
(or ss specific chemical treatment) 



20+ 

"ao^ 



22+ _-j2i 



6+, 



22^ 



•T! 



26^ 



24+ 



34+ 



34- 



32+ 



T- 



30+ 



28+ 



28* 



Rg. 9A 



SUBSTnvrE sheet (rule 26) 



wo 97/33000 



11/22 



PCT/U^y03499 



20^ 
"20^ 



i 



Cleave site-speciflcalty at the location of the base 
mismatch (can occur simultaneous to primary digest) 



34+ 



34- 



6+(Mut) 
7+{Mut) 



Z2* 
22^ 



jf site-i 



fragments with mismatch undergoing 
site-speclftc cleavage 



30- 



24- 



28+ 



28* 

Isolate nonrandom length fragments from 
oligonucleotide probes and analyze by mass 
spectrometry 



2Q^20+(Mut) 25^Mut) 

./ 22+ 24+ / 28+ 30+ 34+ 




|tiniiiii|iniii Ni|iiiiiii i i|ii i iiiin|iiii i iiii| i ii i iiiii|i i i f iiiii^ 

2000 aOOO 4000 5000 6000 700O 8000 9O0O 10000 

Mass 



Fig 9B 



SWSnrUTE sheet (rule 26) 



wo 97/39000 



12/22 



PCTAJS97/«03499 



(A) Wild type target nucleic acid 



(B) Mutant target nucleic acid 
(A to T transverston} 



Heterozygous mix 
161 



-A- 
-T- 



-T- 
-A- 



+ 

161 



(A) Wild type nonrandom length 
fragments (NLF) 



Fragment target nucleic acid using 
restriction endonucieases 



34+ 



30- 



1g+ ^ 32-t- 
19- 32- 



(B) Mutant NLF (A to T transverslon) 
34+ 19+ 



30' 



19- 



-T- 
— A- 
32- 



+ 

32-1- 



21+ 



29- 



21 4- 



281- 



27+ 



20- 



28+ 



31- 



27+ 



29- 



20- 



31- 



(C) Wild type/Mutant heterozygous NLF 
(A*A mismatch) 



Denature/anneal (produces a mixture of 
I species A, B. C. and 0) 



34+ 19+ 


•A. 
• 


32+ 


21 + 


28+ 


27+ 












30- 19- 


~A' 


32- 


29- 


20- 


31- 


(D) Mutant/Wild type heterozygous NLF 
(T»T mismatch) 












34+ 19+ 


T 


%+ 


21+ 


28+ 


27+ 




• 










30- 19* 


— T 


32* 


29- 


20- 


31- 



Rg. 10A 



SUBSTITUTE SHEET (RULE 28) 



wo 97/33000 



13/22 



rCT/U$97y03499 



1 



mutation-specific cleavage at the location of 
the base mrsmatdi (affects species (C) and 
(0) only). 



(C) Wild type/Mutant heterozygous NLF 
(A«A mismatch) 



34+ 19^. 8+./ 24+ 21+ 2B+ 



-A/ 28+ 27+ 




(D) fvlutantAVild type heterozygous NLF \~ 
(T«T mismatch) 



34+ 



30- 

CUT 



30: 19- 11- / 21- 29- 20- 31- 

CUT 



27+ 




8+(IVlut) 



\ 8+(Wt) 
\/ 



JL 



identical 
l1-(IVhJt&Wt) 

/ 



^0- identical 
21 + 

\21-(Wt) 
21-(Mut; 



identical 31. _„ ... 

24+{Mut&Wt) 30-'\ ^ilJf^ 




l/3i-{Mut) 
// 



'"""{"" jHiiiii>i|iiiiMii i| i i i ini i i| i i i iiiiii|iiiiiHHmiii i iii|iiiiiiiii|iiiiin 



rjliiMiiiijiiiiiiiMji||||||||j||uilHI|tlTTTTriljllllllllJj|||||||||m|n 
3000 4000 5000 6000 7000 8000 9000 10000 11000 

Mass 



Rg. 10B 



SUBSmriTE SHEET (RULE 28) 



wo 97/33000 



PCT/US97/D3499 



14/22 



Mutant target nucleic acid 
(A to T transversion) 



-T- 



161 



Isolate on!/ mutant strand 

Mutant -f strand NLF 

34+ 19+ Q*t/ 24+ 21+ 28+ 



1 
i 

1 



Fragment target nucleic acid using 
restriction endonucleases 



Denature/anneal to form heterozygotes 



Site specif icaily cleave at locations of 
base mismatch 



19+ 



27+ 



mismatch cut site 



1 



Analyze by mass spectrometry 



1 



19+ 



21 + 24+ 



I 27+28+ 

LiJu__Ja. 



344- 



i ni i i ! ii |i i iiii iii |iiiii ii i i|iii iii iii| i ii i i i iii|iiiiiiMi|iii i iiiii |iiiiiiiii|iiiiiiiii|iiiii 

3000 4000 5000 6000 7000 8000 9000 10000 11000 

Mass 



Rg.ll 



SUBSrmiTE SHEET (RULE 26) 



wo 97)33000 



15/22 



PCT/U597/03499 



Single-istranded nucleic acid target 



161+ 




Denature/anneaJ to form thermodynamlcaily 
favored seconda^/tertiary structure. 



Fragment using a structure-specific 
endonudease 



A 



cut sites 



12+ 



39+ 



19+ 17+ 



48+ 



26+ 



(1) Purify nonrandom length fragmsnts. 

(2) Analyze i)y mass spectrometry. 



Fig. 12 



SUBSTITUTE SHEET (RULE 28) 



FCrAJS97/034» 



WO 97/33000 

16/22 

Tube (1) Make capture probe using biotlnylated primer clurir>g amplification of target. 

161 



Capture to streptavidln-coated solid phase 
support, denature to release unbound strand, 
wash. 



B 



161- 



Tube (2) AmpJify target nucteic add and fragment using restriction enzymes. 

161 




mutation 



■T- 
■A- 



Fragment target nucleic acid using 
restriction endonucteaaes. . 



34+ 



30- 



19- 32- 29- 20- 31. 



Mix contents of Tubes (1) and (2), denature/anneal strand of fragmented 
target nucleic acid to soHd-phasa-bound capture probe. 



344* 



194- 32+ 



21+ 2S+ 



27+ 



161- 



Fig. 13A 




SUBSTITUTE SHEET (RULE 26) 



wo 97/33000 



17/22 i 



PCTAJS97«3499 



(Optional) 



Cleave site'Specifrcally at the location of any loop or 
mismatch using targeted endonuclease. 



34^ 



19+ 8+ _/24+ 




2U 2a+ 



27+ 



60- 

mutatjon-spedfic cut 



101- 




Standard expected spectra 



19+ 

I 21+ 



lillllilllM 



(1) Wash solid phase bound products to remove any 
unbound DNA and ail contaminants. (2) Release 
single-stranded fragments by denaturation of the 
bound duplex. (3) Analyze by mass spectrometry. 



27+ 28+ 32+(Mut) 34+ 



7000 



iiiiiii|iniii ii i [imiiiii[Hiiuiii[iMni i iij 



8000 



9000 
Mass 



10000 



11000 



12000 



Expected spectra with, optional mutation-spedfic cutting 
21+ 24+(Mut) 27+ 28+ 




lllllllllillll 



7000 



8000 



9000 
Mass 



10000 



11000 



12000 



Fig. 13B 



SUBSITTUTE SHEET (RULE 2B) 



WO97O3000 



PCT/US97/03499 



18/22 




Fig. 14 

SUBSTITUTE SHEET (RULE 2^ 



wo 9703000 



PCT/US97/D3499 



19/22 




Signal 



Fjg. 15 

SUBSTITUTE SHEET (RULE 28) 



wo 97/33000 



20/22 



PCTAJ897/03499 




Signal 

Fig. 16 
suBsrmne sheet (rule 26) 



wo 97/33000 



21/22 




SUBSTITUTE SHEET (RULE 28) 



wo 97/33000 



PCT/US97/D3499 




suflsnruTE sheet (rule 20) 



MTi I UiKn A. IllilH AL» DlLAKvtl KCJTvlH A 


hMv lul Applic«fton No 
DPT /HQ Q7/fl?AQQ 






Aceorduix lo (nttnuAonai Pacm QMiiAeioan (IPQ or %o both nuaui clmAoi^ uid IPC 




B. FIELDS SEARCHED 


Minfimm documcnMan icutiMd (damfiottoii syson Mlowerf toy dassOcakoo tymbott) 

IPC 6 C12Q 


DoevncatAttoo ictrctiad othv ttuA nwnmm doamoMMoo to ih* cMuat Mfueh documcntt m inc 


hided in the fiddi iMrchcd 



Eledranw d«u baw nuwulud duhn^ tfM intcnuaoQil scvdi (nunc of dtu tmc «bdL pnctiul, tatf di ums wed} 



C. DOCUMEWTS CDWMDERED TO BE RELEVANT 



CaHDOfy * 


Qution ofdocwBHU Mdn Hidiauk wbrn tpfnnpfufc, oTchB itjcvaat powifrf 


■UlmtntMdaunNA. 


Y 

Y 
Y 


ANALYTICAL CHEHISTRY, 

vol. 66. no. 10. 15 Nay 1994. 

pages 1637-1645. XP6d0579973 

WU K J ET AL: 'TIME-OF-FLIGHT MASS 

SPECTROMETRY OF UNOERIVATIZED 

SINGLE-STRANOED ONA OLIGOMERS BY . 

MATRIX-ASSISTED LASER OESORPTION' 

set the whole document 

WO 95 67361 A (PASTEUR INSTITUT ;INST NAT 
SANTE RECH MED (FR); MEO TOMMASO (FR);) 16 
March 1995 

see the whole document 

WO 91 156ee A (HOPE CITY) 17 October 1991 
see the whole document 

-/-- 


1-45 

1-41 » 
43-45 

42 


")(| FutedociKitalitnluiidiftfttnflAittiMonarbaKC. jxj| P«Mi teatty mObM «« hrtei in uiimii. 


'Spadal OLsUnvofdiaddocuBaa; T* lam-dogBMrnpiMidicddlgftiiitirmitinnii 

•A' Aioncni<Mim dJtf^b«BdmSB?£fpSiSpic«r 
ooBMifto B Dt fli ptraMHT wiwmw ilJKfrtrwi 

fUo^^ ^ ^fStKniiii TiiiMiiimfnnTynf^ 
V doanait«Mcb ni«ydtfmrdoateoaprfi»iiydita(dor iswihwndiwdriMd^piMiMdiidDcinimHtalmi^ 

ot«Unorolb«9«iaii«MQe(af tpcoOcd) cmh fc» wiSliwit to iawiw — wwmi»> «fcp wtw di« 
'O* dKiHMmrtC0«fH«»n<nl^MB^uiK.«diihittoaor iocamntiieoattBidwiaiMiarMiodMriudidBea- 
ottMrmMiv iDcnftfc iBd>cDrntiiMliiwhtat<ftd«dtt*pCT»»*ilktf 

'P* itoajvMU jiiMidiMl phor to ihe utentftoeM aiai| dak bat ■) «1 

Ivor T&u the pdorttj dM» ddowd '*:* annarwit vtonbcr cf Or smm pctort fsmdy 


9 July 1997 




Nmw tfid autlia« «d*m of tte ISA 

EwepMa PttmOOki* P.a Itit PtfMtoia 2 

NLSIOKVRlindIc 

m. { ♦ MO^aMO^ Tk )I «3I cpo rf, 

PlE(-»ll-1<0)|4l^lOI« 


AutlMnicd oCOmt 

NQIIer. F 



Pm PCTASAOia (mi dMM) (fuiy iflf>» 



page 1 of 2 





INTERNATIONAL SEARCH REPORT i 


liertoa No 




1 PCT/US 97/93499 


QCoottcautiGa) DOCUMSNT^ CONSlDeRfiO TO BB RBLBVaNT 




auiicn of dmiiacM, Mih aAciuan, wMK ippRiphait, «r Au ninnipmtfi 


Rilfvwl Id daia No. 


P,X 


WO 96 32594 A (UNIV BOSTON) 17 October 
1996 

see abstract and claims 


1-45 




MO 96 29431 A (SEQUENON INC) 26 Septenter 
1996 

see whole document, esp. claim 48 


I- 8, 

II- 49. 
43-45 



page 2 of 2 



INTERNATIONAL SEARCH REPORT 


Interr nsl A 

PCT/US 


^ppjictaw No 

97/03499 


Patent documsni 
died In Much rtport 


Publicfttfaa 
date 


pMcnt funily 
membtrtf) 


PubJinKion ; 
date 



wo 9567361 A 16-83-95 FR 2709761 A 17-03-95 

CA 2171469 A 16-63-95 
19 8717781 A 26-66-96 



WO 9115606 


A 


17-10-91 


AU 


7762691 A 


30-10-91 


WO 9632504 


A 


17-10-96 


Ali 


5544696 A 


38-16-96 


WO 9629431 


A 


26-69-96 


US 


5605798 A 


25-02-97 






AU 


5365196 A 


08-10-96 



Pan* KTAUOII <»MmI Mr WAffl |MV imi 



This Page is Inserted by IFW Indexing and Scannings 
Operations and is not part of the Official Record 



Defective images within this document are accurate representations of the original 
documents submitted by the appHcant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 



□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: ■ 



IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



BEST AVAILABLE IMAGES 




LINES OR MARKS ON ORIGINAL DOCUMENT 



